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(3) Non-A non B hepatitis-specific antigen and Its use In hepatitus diagnosis. 

@ This invention relates to a DNA fragment comprising a base sequence encoding a non-A non-B 
hepatitis-specific antigen polypeptide, said base sequence being obtained using genetic engineering 
techniques from non-A non-B hepatitis virus RNA which is isolated directly from blood plasma from 
non-A non-B hepatitis patients, to an expression vector and a transformant for use in the expression of 
the DNA fragment, to a single strand DNA sequence for PCR primer, and to use of said polypeptide and 
said single strand DNA sequence in the detection of the non-A non-B hepatitis virus. The recombinant 
polypeptide and the single strand DNA sequence for PCR primer make it possible to detect the non-A 
non-B hepatitis virus with extremely high accuracy. 
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The present invention relates to a novel DNA fragment encoding non^A non-B hepatitis-specific antigen 
polypeptide which is found at the time of infection or onset of the non-A non-B hepatitis. 

It also relates to an expression vector containing said DNA fragment to a transformant transformed with 
said expression vector and to an expressed polypeptide obtained by culturing said transformant 
5 It further relates to a single strand DNA sequence for PCR primer synthesized on the basis of a partial base 

sequence of said DNA fragment 

It also relates to the use of said expressed polypeptide and said single strand DNA sequence in detection 
of the non-A non-B hepatitis virus. 

10 BACKGROUND OF THE INVENTION 

Non-A non-B hepatitis is an infectious disease which is caused by a masked virus other than A and B hepati- 
tis viruses, but it is not easy to identify the virus because an amount of the virus-specific antigens is very small 
in a patient's body as well as of anti-virus antibodies. Accordingly, diagnosis of non-A non-B hepatitis has been 

is made serologically by the well-known "diagnosis by exclusion" method wherein increase in the levels of alanine 
aminotransferase and aspartate aminotransferase is determined for a serum from a patient to make a diagnosis 
whether or not the hepatitis belongs to any of hepatitis A, hepatitis B f hepatitis D and other hepatitis symptoms 
due to the known hepatopathy-causing viruses such as CMV, EBV, etc, and if the result of diagnosis are not 
applicable to them, then a case is identified as non-A non-B hepatitis. It however, is difficult to diagnose dinn 

20 cally as being non-A non-B hepatitis by such a method because there is no correlation between ALT value and 
non-A non-B hepatitis. Also, the lack of trustworthy means for the diagnosis is a serious problem, whereby a 
secondary infection with the non-A non-B hepatitis virus which may be caused by transfusing blood, especially, 
from a non-A non-B hepatitis virus-carrying healthy carrier into a person can hardly be prevented. Therefore, 
it is considered that the non^A non-B hepatitis occupies more than 90% of hepatitis cases caused by Wood 

25 transfusion, with a total of about one million patients per one year. 

In order to improve such situation and to raise a diagnostic accuracy of non-A non-B hepatitis, Alter's panel 
in which a standard serum is used has been developed by Alter at al at the NIH. Diagnostic materials which 
can pass the Alter's panel have been obtained by Arima ef a/ [JIKKEN IGAKU (Japan), 7 (2), 1 96 - 201 (1 989)] 
and by M. Houghton et al (WO 89/04669, PCT/JP90/500880) of Chiron Corp. almost simultaneously. Arima ef 

30 a! have screened the sera from hepatitis patients using Xgt11 (a protein expression vector) which is derived 
from viral RNA from a non-A non-B hapatitis patient's serum. Also, Chiron Corp. have inoculated the patient's 
blood plasma into a chimpanzee to develop a chronic hepatitis, blood plasma being obtained from the diseased 
animal which possesses the anti-virus antibodies with high titer, and then have screened in the same way as 
Arima ef al Chiron Corp.'s group has also succeed in cloning almost the whole portion of the gene of a hepatitis 

35 C virus (HCV, designated by Chiron Corp.) and developed a kit for diagnosis which comprises anantigen protein 
obtained by expressing a part of the HCV gene. 

In spite of such an effort, however, factors of this disease, even their numbers, have not yet been elucidated 
to the full. 

As described above, the two materials which can pass the Alter's panel has certainly lead to a new tech- 

40 nique of diagnosis replaced by said "diagnosis by exclusion*, but screening patient* s sera separately with the 
materials gives no results to be satisfied because both the materials from Arima ef al and Chiron Corp. react 
with patiehf s sera In low positive ratios of about 60 to 80% and about 50 to 70%, respectively. In other words, 
in some cases, these materials would not react with sera from the patients who have been diagnosed clinically 
as non-A non-B hepatitis. A virus commonly have a function to cause mutation in their host cells for their surval, 

45 and thus the viral genes isolated from American patients by Chiron Corp. had been possibly mutated into vari- 
ous forms acclimated to the chimpanzee as an infection intermediate. 

Accordingly, a great demand has been directed to a large scale preparation of the reactive antigens which 
are capable of probing thenoh-A non-B hepatitis patients or carriers, therefor it win be necessary to construct 
effective cDNA clones through the isolation and purification of variously mutated viral RNA from many non-A 

so non-B hepatitis patients. 

In addition, in the case of sera which have faOed in a trustworthy diagnosis using an antibody detection 
system, or of sera which are collected Immediately after infection and in which antibody titers do not yet raise, 
a gene amplification method (PCR method) may be useful for the confirmation of the disease because it can 
detect a trace amount of viral genes. Also, it is possible to clone the genes efficiently by the PCR method. How- 

55 ever, since the PCR method is carried out using primers which are synthesized from a known gene sequence, 
it is not always possible to detect a gene of the non-A non-B hepatitis virus in a patienf s fluid using a primer(s) 
which can be constructed on the basis of the HCV gene sequences determined by Chiron Corp., if a difference 
in mutation between said HCV gene of Chiron Corp. and said patient-carried viral gene is significant 
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In consequence, to detect efficiently infection with the non-A non-B hepatitis vims, it is necessary to prepare 
at least one primer capable of detecting the viral gene with a high specificity. Such a purpose may be accom- 
plished by isolating a great number of cDNA clones, synthesizing primers from relatively preserved regions 
among their gene sequences, and subjecting the primers obtained to screening through the PCR method. 

5 

SUMMARY OF THE INVENTION 

This invention provides a novel DNA fragment which encodes a non-A non-B hepatitis-specific antigen 
polypeptide originated from a non-structural or structural protein of the non-A non-B hepatitis virus, the polypep- 
w tide being formed at the time of the infection or onset of the non-A non-B hepatitis. 

This invention also provides an expression vector containing the DNA fragment, a transformant transfor- 
med with the expression vector, an expressed polypeptide obtained by culturing the transformant, and a pro- 
cess for its production. 

This invention further provides a primer for use in the detection of non-A non-B type hepatitis virus genes. 
15 This invention further yet provides use of the expressed polypeptide or single strand DNA primer in detec- 
tion of the non-A non-B hepatitis virus, and a method for the detection of non-A non-B type hepatitis virus genes 
and anti-non-A non-B type hepatitis virus antibodies. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

Fig. 1 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C11-7 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ ID 
NO. 1. 

Fig. 2 shows a nucleotide sequence of non-A non-B hepatitis specific-cDNA which is encoded in a clone 
25 C10-1 1 and an amino acid sequence deduced from the sequence. These sequences are equvalent to SF.Q ID 
NO. 2. 

Fig. 3 shows a nucleotide sequence of non-A non-B hepatitis specific-cDNA which is encoded in a clone 
C10-13 and an amino acid sequence deduced from the sequence. These sequences are equvalent to SEQ ID 
NO. 3. 

30 Fig. 4 shows a nucleotide sequence of non-A non-B hepatitis specific-cDNA which is encoded in a clone 
C10-14 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 4. 

Figs. 5 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-15 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
35 ID NO. 5. 

Fig. 6 shows a nucleotide sequence of non-A non-B hepatitis specific-cDNA which is encoded in a clone 
C10-16 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 6. 

Fig. 7 shows a nucleotide sequence of non-A non-B hepatitis specfftc-cDNA which is encoded In a clone 
40 C 10-1 7 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 7. 

Figs. 8 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-18 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 8. 

45 Fig. 9 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-19 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 9. 

Figs. 1 0 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-21 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
60 ID NO. 10. 

Fig. 1 1 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a done 
C10-22 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 11. 

Fig. 12 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
55 C10-23 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 12. 

Fig. 13 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-35 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
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ID NO. 13. 

Fig. 14 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C11-C21 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 14. 

5 Fig. 1 5 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 

C10-E12 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 15. 

Fig. 1 6 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-E13 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
10 ID NO. 16. 

Fig. 17 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
C10-E24 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 17. 

Fig. 16 shows a nucleotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a clone 
15 C10-E15 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 18. 

Fig. 19 shows a flow sheet for the construction of an expression plasmid Trp TrpE C1 1-7. 

Fig. 20 shows a flow sheet for the construction of an expression plasmid TrpTrpBCI 1-C21 . 

Fig. 21 is a photograph showing the results of western blotting analysis of an expressed product. TrpEd 1- 
20 7, with serum from a normal person or non-A non-B hepatitis patient, wherein the antigens used are a purified 
antigen in A, an extract of expressed cells in B, and an extract of non-expressed cells in C. 

Fig. 22 is a photograph showing the results of western blotting of an expressed product, TrpEd 1-C21, 
with sera (A, B) from two normal persons or non-A non-B hepatitis patients. 

Fig. 23 is a graphical representation of the EUSA-determined positive numbers in Table 4. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

Many aspects and advantages of the present invention will be made apparent to those skilled in the art by 
the following detailed description about preferred embodiments of the invention. 
30 The present invention provides a specified DNA fragment comprising a base sequence which encodes a 
nbn-A non-B hepatitis-specific antigen polypeptide from a non-structural or structural protein of the non-A non-B 
type hepatitis virus. 

In preparation of the DNA fragment of the present invention, it is characterized that variously mutated genes 
of pathogenic viruses were directly collected from fresh blood plasma pools of a number of non-A non-B hepati- 

35 tis patients. More particularly, the preparation comprises the steps in which total RNA molecules including non- 
A non-B hepatitis virus RNA are isolated from the blood plasma pool, cDNAs are synthesized based on the 
isolated RNA molecules by the well-known random primer method, and then the cDNAs obtained are incorpo- 
rated into X phage to prepare a cDNA library. The cDNA library is subsequently immunoscreened using sera 
from a non-A non-B hepatitis patient to obtain the DNA fragments of interest Thereafter, using the resulting 

40 DNA fragments as probe, cDNA libraries obtained from the blood plasma from several chronic non-A non-B 
hepatitis patients were subjected to hybridization assay In order to isolate a cDNA which has different homology 
from the known counterparts and which is specific for the non-A non-B hepatitis patient 

Such a process makes ft possible to provide the viral antigens which are markedly useful for the diagnosis 
of non-A non-B hepatitis patients carrying the variously mutated viruses and for the improvement of detection 

45 accuracy of the hepatitis viruses contained in blood for transfusion which was collected from many latent car- 
riers carrying non-A non-B type hepatitis viruses. 

The following describes the present invention in detaB with regard to the preparation of cDNA library, iso- 
lation and sequencing of Defragments, expression and isolation of polypeptides, and their application to diag- 
nosis of non-A non-B hepatitis using enzyme-linked immunosorbent assay (EUSA) or PCR method. 

50 - 
Preparation of cDNA library 

Firstly, cell debris is removed from each of freshly collected blood plasma samples of several non-A non-B 
hepatitis patients by centrifugation and the resulting supernatant is again subjected to centrifugation at a higher 
55 speed of rotation to obtain a pellet The pellet is subjected to an equilibrium density gradient centrifugation using 
cesium trifluoroacetate to isolate total RNA as a precipitate, and the total RNA is purified by phenol/chloroform 
extraction and ethanol precipitation. 

By the method of Gubler and Hoffman using random primers, cDNA is synthesized from the above RNA 
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fraction. The cDNA is methylated by treating it with a DNA methyiase (for example, EcoRl methytase), connec- 
ted with a DNA tinker (for example, EcoRl linker) or DNA adapter (for example, EcoRl adapter), and then cloned 
into a cloning vector such as X phage (for example, Xgt10 or Xgt11) to prepare a cDNA library. 

5 Isolation and sequencing of DNA fragments 

Next, Escherichia coli is infected with the X phage cDNA library and cultured on an agar plate to form pla- 
ques. These plaques are transferred on a nitrocellulose filter, and subjected to blocking followed by immunosc- 
reening using a non-A non-B hepatitis serum in order to detect positive clones. Alternatively, to improve effi- 
10 ciency of the screening, each positive clone obtained is cloned into a cloning vector such as plasmid and a 
^P-labeled DNA probe is prepared by random primer technique, and then positive plaques are detected from 
the aforementioned cDNA library using the probe. 

Eighteen clones in total were obtained by the above procedure and designated as C1 1-7, C10-1 1, C10-13, 
' C10-14, C10-15, C10-16, C10-17, C10-18, C10-19, C10-21, C10-22, C10-23, C10-35, C11-C12, C10-E12, 
15 C1 0-E1 3, C1 0-E24 and C1 0-E1 5. 

A cDNA sample is obtained from X phage DNA of each 18 clones in a traditional manner and digested with 
appropriate restriction enzymes such as EcoRl and BamH\. Each cDNA fragment obtained is purified by agar- 
ose gel electrophoresis, incorporated into a sequencing vector (M 13 phage), and then subjected to the dideoxy 
chain termination method [Sanger et at Proc. Natl. Acad. Sc/„ USA, 74, 5463 (1977)] in order to determine a 
20 base sequence of each cDNA fragment. 

Nucleotide sequences of these clones and deduced amino acid sequences are shown in Figs. 1 to 18 and 
in a Sequence Listing which will be described later as SEQ ID NOs. 1 to 18. That is, the SEQ ID NOs. 1 to 18 
respectively represent the nucleotide and deduced amino acid sequences determined from clones C1 1-7, C1 0- 
11, C10-13, C10-14, C10-15, C10-16, C10-17, C10-18, C10-19, C10-21, C10-22, C10-23, C10-35, C11-C12, 
25 C10-E12, C10-E13, C10-E24 and C10-E15. Also, the base pair (BP) number of their DNA fragments is 763 
BP, 615 BP, 771 BP. 630 BP, 1426 BP, 855 BP, 315 BP, 911 BP, 489 BP, 1076 BP, 284 BP, 641 BP, 432 BP, 
369 BP, 932 BP, 559 BP, 276 BP and 742BP, respectively. 

All the 18 clones contained a continuous open reading frame but with no termination codon. 
Analysis of genomic RNA has revealed that hepatitis C virus (HCV) is a class of virus similar to the genus 
30 FlavMrus such as Japanese encephalitis virus [Protein, Nucleic Add and Enzyme (Japan), 35 (12), 21 1 7 - 2127 
(1 990)]. From the comparison of homology between the reported gene and polypeptide of FlavNims and those 
of the present invention, it was found that clones C1 1-C12, C10-E12, C10-E13, C10-E24 and C10-E15 encode 
a structural protein of the non-A non-B type hepatitis virus. More particuarly, done C11-C12 is a gene which 
encodes the core of non-A non-B hepatitis virus, and clones C10-E12, C10-E13, C10-E24 and C10-E15 are 
35 genes encoding a region from the latter half of the virus core to the env or a region downstream from the env. 
Other clones were found to be genes encoding non-structural proteins of the virus. 

The nucleotide sequences of the above 18 clones and the amino acid sequences translated along the open 
reading frames showed homologies with those of hepatitis C virus (HCV) reported by Houghton et a! (EP-A- 
318,216, 1988). In other words, clones C11-7, C10-16, C10-17, C10-18, C10-19, C10-21, C10-22 and C10-23 
40 showed relatively high homologies with HCV: 80 to 82% homology at nucleic acid level and 91 to 94% at amino 
acid level. In addition, these clones showed more higher homologies with the sequence J1 reported by Miyam- 
ura etaL {Nuc. Act. Res., 17, 10367 - 10372, 1989): 85 to 95% homology at nucleic acid level and 87 to 100% 
at amino acid level. These clones were classified as group 1 because of high homology in their overlapped 
portion. On the contrary, clones C10-11, C10-13, C10-14, C10-15 and C10-35 showed low homologies when 
45 compared to the nucleotide and amino acid sequences of HCV and J1, i.e., 69 to 70% homology at nucleic 
acid level and 75 to 80% at amino acid level, they were therefore classified as group 2. 

In addition, when the 369 BP nucleotide and deduced 123-amino acid sequences, indicated as SEQ ID 
NO. 14, for the CI 1-C21 done encoding a structural protein of the virus were compared with the portions over- 
lapped with HCV reported by Houghton et al (WO 90/1 1089), a nucleic add homology of 81.8% and an amino 
so acid homology of 87% were found. Also, when compared with HCV clones, HC-J1 and HC-J4, obtained from 
a Japanese patient (Okamoto et a/.; Japan J. Exp. Med., 60, 3, p. 167 - 177, 1990), homologies of 82.1% and 
82.7% at nucleic acid level and 87.6% and 89.4% at amino acid level were shown. Since the same regions 
among the reported three clones (HCV by Houghton et al. and HC-J1 and HC-J4 by Okamoto et al.) have high 
homologies of 92.1 to 97.6% at nucleic acid level and 95.5 to 96.7% at amino acid level, it has been found that 
55 the clone C1 1 -C21 obtained by the present inventors has a certain distance from the reported clones in terms 
of homology and therefore is a different group of viral gene therefrom. The remaining 4 clones, C1 0-E12, C10- 
E13 and C10-E15, showed homologies of 83 to 93% at nucleic acid level and 82 to 95% at amino acid level 
when compared with the HCV, HC-J1 and HC-J4, while C10-E24 showed around 63% of homology at nucleic 
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acid level and around 60% of homology at amino acid level. 

However, no homology was found either at nucleic acid level or amino acid level, when the DNA fragments 

of the present invention were compared with any DNA fragment encoding non-A non-B hepatitis antigens which 

have been disclosed in Jpanese patent Application Laying-Open (KOKAI) Nos. 89/2576 and 89/124387. 
5 Consequently, the clones C10-11, C10-13, C10-14, C10-15, C10-35, C11-C12, and C10-E24 have low 

homologies with the reported clones both at nucleic acid and amino acid levels. Other clones are also distiguish- 

able from the reported clones. 

Therefore, the present invention provides a DNA fragment comprising a base sequence which encodes a 

non-A non-B hepatitis-specific antigen polypeptide, said polypeptide consisting of the whole or a part of the 
10 amino acid sequence which is encoded along the open reading frame and represented by any one of the SEQ 

ID NOs. 1 to 18. 

Naturally, the base sequences according to the present invention include any other base sequence which 
comprises other codons corresponding to each amino acid. 

Among the aforementioned clones, C11-7,C10-11 f C10-13, C10-14, C10-15, C10-16, C10-17,C10-18and 

15 C1 0-1 9 were transformed into E. coli HB1 01 strain and deposited on July 6, 1 990 with Fermentation Research 
Institute, Agency of Industrial Science and Technology, 1-3 Higashi 1-chome, Tsukuba-shi, Ibaraki-ken, 305, 
Japan, respectively as E. coli HB101/C11-7 (Accession Number FERM P-11589), E. coli HB101/C10-11 
(FERM P-11581), E. coli HB101/C1 0-13 (FERM P-11582), E. coli HB101/C10-14 (FERM P-11583), E. coli 
HB101/C10-15 (FERM P-11584), E. coli HB 101 /C10-1 6 (FERM P-11585), E coli HB101/C1 0-17 (FERM P- 

20 11586), E. coli HB101/C10-18 (FERM P-11587) and E. co// HB101/C10-19 (FERM P-11588). These deposi- 
tions were subsequently converted on June 13, 1991 to an international deposition under Budapest Treaty by 
the same depositary institution as an international depositary authority set forth in Budapest Treaty to be given 
the following new Accession Numbers: 

25 



30 



35 



E. coli HB101 


Acr.es <3 ion No 


Clone 


Cll-7 


3442 


Clone 


C10-11 


3434 


Clone 


C10-13 


3435 


Clone 


C10-14 


3436 


Clone 


C10-15 


3437 


Clone 


C10-16 


3438 


Clone 


C10-17 


3439 


Clone 


C10-18 


3440 


Clone 


C10-19 


3441 



40 



45 Also, clones C11-C21, C10-E12, C10-E13, C10-E24 and C10-E15 were transformed into E co// JM109 
strain and deposited on December 1 1 , 1 990 with Fermentation Research Institute, Agency of Industrial Science 
and Technology, the same address, respectively under Accession Numbers FERM P-1 1892, FERM P-1 1894, 
FERM P-1 1895, FERM P-t*896 and FERM P-1 1897. These depositions were also subsequently converted 
on June 17, 1991 fordone C11-C12and on June 13, 1991 for other clones to an International deposition under 

50 Budapest Treaty in the same way. The following new Accession Numbers were given: 
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Accession No. ( FERM BP-) 



Clone C11-C21 



3450 



5 



Clone C10-E12 



3444 



Clone C10-E13 



3445 



10 



Clone C10-E24 



3446 



Clone C10-E15 



3447 



As described in the foregoing, the DNA fragments according to the present invention are different from any 
15 other prior DNA fragment Though non-A non-B hepatitis virus is generally divided into two classes, namely 
groups 1 and 2, on the basis of the comparison of homology between the clones encoding a non-structural reag- 
ion of the hepatitis vims, there is a possibility of existing an intermediate group or even a third group because 
the virus is very susceptible to mutation in its host cells. It may be accordingly difficult to correctly diagnose all 
the non-A non-B hepatitis patients using an antigen protein prepared from only one kind of DNA fragment In 
20 order to overcome such a problem and to improve an efficiency of the diagnosis, it is necessary to establish 
such a useful process for the preparation of DNA that a number of effective clones can easily be obtained, and 
to use several types of clones in combination in diagnosis 

Expression of non-A non-B hepatitis specific antigen polypeptide 



The present invention also provides an expression vector which is constructed by introducing the above- 
mentioned DNA fragment into a cloning site downstream of a promoter gene in a vector. 

Any conventional vector may be used such as plasmid, phage or the like. An expression vector may be 
constructed by the well-known techniques in the art The following describes some processes for constructing 
30 the expression vectors of the invention. 

Construction of expression plasmid TrpTrpEC11-7: 

A flow sheet for the construction of the expression plasmid Tn>TrpE-C1 1 -7 is shown in Fig. 1 9. 

35 Firstly, a plasmid pUCd 1-7 DNA obtained by incorporating the clone C1 1-7 into pUC1 19 is digested with 
restriction enzymes BamHI and Seal, and the resulting BamHI - Seal fragment is isolated by agarose gel 
electrophoresis and then purified by a glass powder technique. Separately from this, an expression vector 
Trp-TrpE DNA Is digested with BamHI and Seal, treated with a bacterial alkaline phosphatase (BAP) t and then 
extracted with phenol. The aqueous layer obtained is subsequently subjected to ethanol precipitation to obtain 

40 a treated vector DNA. By connecting the vector DNA with the aforementioned C1 1-7 DNA fragment in the pre- 
sence of T4 DNA ligase, the expression plasmid Trp TrpE*C1 1-7 is obtained in which the DNA encoding the 
non-A non-B hepatitis-specific antigen is located downstream of a promoter so that transcription of the DNA 
can be controlled by the promoter. 

45 Construction of expression plasmid Trp TrpEd 1-C21 ; 

A flow sheet for the construction of the expression plasmid Trp-TrpE-d 1 -C21 is shown in Fig. 20. 
Firstly, a DNA fragmenTcbntaining a stop codon in Its 3' terminal is prepared from a plasmid pUC-C1 1-C21 
DNA which Is obtained by incorporating the C1 1-C21 clone into pUC1 1 9 f by a gene amplification method (PCR) 

60 using two primers (5'-TTACGAATTCATGGGCACGAATCCT-3' and S'-TTAATCGATGACCTTACCCA- 
CATTGCG-3'). By ligating the thus-prepared DNA fragment with pUC1 18 which is predigested with Smal, a 
plasmid pUC118C11-C21Sma is obtained. This plasmid is then digested with EcoRI and BamHI, and the 
resulting DNA fragment is isolated by agarose gel electrophoresis and then purified by glass powder technique. 
Separately from this, an expression vector TrpTrpE DNA (Japanese Patent Application No. 90/180889) is 

55 digested with BamHI and EcoRI, treated with a bacterial alkaline phosphatase (BAP), and then extracted with 
phenol. The aqueous layer obtained is subsequently subjected to ehanol precipitation to obtain a treated vector 
DNA. By connecting the vector DNA with the aforementioned C1 1 -C21 DNA fragment by the action of T4 DNA 
ligase in a ligation buffer solution, the expression plasmid Trp TrpE C11-C21 is obtained in which the DNA-en- 
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coded polypeptide from a structural protein of the non-A norvB hepatitis virus is located downstream of a pro- 
moter so that transcription of the DNA can be controlled by the promoter. 

Other clones can also be made into corresponding expression plasmids by treating each clone with approp- 
riate restriction enzymes and introducing the treated fragment into an expression vector. 
5 When a procaryote is used as the host cell, a promoter eligible for use in the present invention may be 

selected from promoters originated from E. co//, phage and the like, such as tryptophan synthase operon (trp), 
lactose operon (lac), X phage P L , X phage P R and the like. When an eucaryote such as yeast is used as the 
host cell, promoters for 3-phosphoglycerate kinase and other glycolysis-related enzymes (Holland et a/; 
Biochemistry, 17: 4900, 1978) may be useful. Though not always required, a transcription termination factor 
10 may preferably be located in the expression vector. 

The vector may further contain a marker sequence, such as an ampicillin or a tetracycline resistance gene, 
which makes it possible to effect a phenotype selection in transformed cells. 

The present invention also provides a transformant which is obtained by introducing the expression vector 
of the invention into a host cell. Microorganisms used commonly in this field, such as £. coft B. subtilis, a yeast 
15 strain and the like, may be used as a host cell. 

Transformation may be effected by any usually used means for the incorporation of an expression vector 
into host cells. When a bacterium (for example, £ cob) is used as host cell, a direct incorporation technique 
with the use of calcium chloride (Mandel, M. and Higa, A; J. Mot. Bio., 53, 159 - 1 62, 1970) may be employed. 

In addition, the polypeptide of the present invention may be produced by inoculating and culturing a suitable 
20 host cell carrying the expression vector in an appropriate medium such as ampicillirvcontaining 2YT medium 
and then propagating expression cells by subculturing them in an ampicillin-containing phosphate medium. 

Production and purification of recombinant non-A non-B hepatitis-specific antigen polypeptide 

25 The present invention also provides a process for producing a non-A non-B hepatitis-specific antigen 
polypeptide, which comprises the following steps of: 

constructing a replicable expression vector which can express the aforementioned DNA fragment of the 
present invention in an appropriate host cell; 

obtaining a transformant by incorporating said expression vector into the host cell; 
30 producing a recombinant polypeptide by culturing said transformant under such conditions that said DNA 

fragment can be expressed; and 

recovering said recombinant polypeptide. 
The crude polypeptide product from host cells may be purified by disintegration of the host cells, for example 
by ultrasonic disintegration, subjecting the disintegrated cells to centrifugation to obtain an insoluble fraction 
35 containing a fused polypeptide between TrpE as signal peptide and a polypeptide encoded by cDNA syn- 
thesized from a non-A non-B hepatitis virus RNA, extracting the fused polypeptide in a soluble form with a urea- 
containing buffer, and then purifying the extracted polypeptide by subjecting it to an ion exchange column 
chromatography (S-Sepharose, for example). 

Accordingly, the present invention also provides a recombinant non-A non-B hepatitis-specific antigen 
40 polypeptide obtained by such a expression process, said polypeptide consisting of the whole or a part of the 
amino acid sequence represented by any one of the SEQ ID NOs. 1 to 18. 

The term "recombinant non-A non-B hepatitis-specific antigen polypeptide" as used herein is intended to 
Include a polypeptide itself which is obtained by expressing In a vector a DNA fragment encoding a non-A non-B 
hepatitis-specific antigen polypeptide, and a fused polypeptide obtained by fusing said polypeptide with other 
45 peptide such as a signal peptide. 

Application to diagnosis of non-A non-B hepatitis 

The expressed polypeptide of the present invention was subjected to SDS-polyacrylamide gel 
so electrophoresis and then allowed to perform antigen-antibody reaction with each two serum samples from nor- 
mal persons or non-A non-B hepatitis patients by means of western blotting, whereby this polypeptide reacted 
strongly with only the patient's sera as shown in Figs. 21 and 22. It was confirmed therefore that the expressed 
polypeptide functions as a non-A non-B hepatitis-specific antigen. 

Accordingly, the present invention also provides a method for immunological detection to detect an anti- 
55 body directed against the non-A non-B hepatitis virus antigen .which comprises the following steps of: 

incubating a sample possibly containing an anti-non-A non-B hepatitis virus antibody together with at 
least one recombinant non-A non-B hepatitis-specific antigen polypeptide of the present invention under such 
conditions that the antigen is capable of reacting immunologically with the antibody; and 
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detecting an antigen-antibody complex. 
Diagnostic effects (positiveness) of the expressed polypeptide TrpEd 1-C21 obtained by expressing the 
expression plasmid TrpTrpEd 1-C21 , another expressed polypeptide TrpEd 1-7 obtained by expressing the 
corresponding expression plasmid Trp TrpE C1 1-7, and an assay kit of Chiron Corp. (ORTHO HCV Ab ELISA 

5 kit) were examined by the conventional enzyme immunoassay through the reaction of the above expressed 
antigens with a serum sample from a patient who has been diagnosed clinically as being non-A non-B hepatitis. 
As the results, positiveness of the kit of Chiron Corp. was found to be 69,7% (23/33 cases) while the TrpE*C1 1 -7 
which belongs to group 1 showed a positiveness of 78.8% (26/33 cases). In the case of the expressed polypep- 
tide TrpE CI 1-C21 , it showed a positiveness of 84.8% (28/33 cases) which is higher than the case of the Chi- 

10 ron's kit. When the expressed polypeptide TrpEd 1-7 as a member of group 1 and the TrpE C11-C21 as a 
member of group 2 were used in combination, the positiveness increased to 93.9% (30/31 cases; see Table 1 
and Fig. 23). 

Therefore, according to an embodiment of the present invention, there is provided a combination of the 
group 1 and group 2-relating expressed polypeptides as a hepatitis-specific antigen polypeptide for use in the 
15 immunological detection. 

The present invention further provides a method for gene amplification which comprises amplifying a non-A 
non-B hepatitis virus gene using sense and/or antisense sequence synthesized on the basis of the DNA sequ- 
ences of the present invention. 

As the synthetic base sequence for PCR primer, the following single strand DNA sequences may be em- 
20 ployed: 

5'-GGATACACCGGTGACTTTGA-3'(sense, SEQ ID NO. 19); 
5'-TGCATGCACGTGGCGATGTA-3'(antisense, SEQ ID NO. 20); 
5'-GATGCCCACTTCCTCTCCCA-3'(sense, SEQ ID NO. 21); and 
5'-GTCAGGGTAACCTCGTTGGT-3'(antisense, SEQ ID NO. 22). 
25 said sequences being sense or antisense of the partial base sequence represented by the SEQ ID NO. 5 for 
the former two primers and by the SEQ ID NOs. 2, 4, 5 or 13 for the latter two primers. These specified primers 
are also within a scope of the invention. 

The single strand DNA sequences may be synthesized by the usual methods such as phosphorous acid 
method, phosphotriester method, solid phase method and the like, though the use of a DNA synthesizer is most 
30 convenient 

When used as a PCR primer, the above single strand DNA sequences show higher specificity for the group 
2 virus genes than for the group 1 virus genes (see Tables 2 and 3). 

Therefore, the present invention also provides a method for detecting the genes from the norvA non-B type 
hepatitis virus in a fluid sample such as serum, which comprises the following steps of: 
35 isolating RNA from the sample, 

synthesizing cDNA by treating the obtained RNA with a reverse transcriptase, 
subjecting the obtained cDNA to polymerase chain reaction using at least one the above-mentioned pri- 
mer; 

detecting an amplified non-A non-B type hepatitis virus gene. 
40 The present invention further provides use of the expressed polypeptides or single strand DNA sequences 
for PCR primer of the present invention in the detection of the nor>-A non-B hepatitis virus. 

The following examples are given to further illustrate the present invention in detail, but it is not intended 
to limit the invention thereby. 

45 Example 1 

Preparation of cDNA library from blood plasma of non-A non-B hepatitis patient 

A cDNA library was prepared using XgtIO and Xgtl 1 phages after preparing an RNA fraction in the following 
so manner from fresh blood plasma pools obtained from several Japanese patients of chronic stage non-A non-B 
hepatitis. 

Five liter of blood plasma was diluted with the equal volume of 50 mM Tris-HCI (pH 8.0) containing 1 mM 
EDTA, cell debris in the dfluted sample was removed by centrifugation at 3,500 g for 20 minutes and then the 
resulting supernatant was again subjected to centrifugation at 45,000 rpm (about 100,000 g) for 4 hours at a 
55 temperature of 4°C to obtain pellet The pellet was dissolved, according to the conventional procedure, in 6M 
guanidium thiocyanate as a protein denaturating agent, layered over a solution of cesium trifluoroacetate, and 
then subjected to centrifugation using Beckman SW50 rotor at 33,000 rpm for 18 hours at a temperature of 
20°C. The resulting pellet was dissolved in 1 0 mM Tris-HCI (pH 7.5) containing 1 mM EDTA and extracted twice 
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with a solvent system of 1:1 phenolxhloroform, afterwhich the organic layer was mixed with 1/10 volume of 5 
M NaCI and 2.5 volumes of ethanol. After standing the mixture for 2 hours at -20°C, it was centrifuged at 15,000 
g for 20 minutes and the pellet was then dissolved in diethylpyrocarbonate-treated water to use as an RNA 
sample. 

5 In accordance with the method of Gubler and Hoffman, cDNA was synthesized from the thus obtained RNA 

sample by means of random primer technique using a commercially available kit (from Amersham or BRL). The 
cDNA was subsequently treated with EcoRI methyiase, ligated with an EcoRI linker or an EcoRI adapter and 
then cloned into the EcoRI site of Xgt10 and Xgt 11 phages. The cDNA library thus prepared contained 10 6 to 
10 7 PFU of recombinant phages in average. 

10 

Example 2 

Isolation of non-A non-B hepatitis-specific cDNA 

15 An attempt was made to isolate cDNA specific for non-A non-B hepatitis from the cDNA library prepared 
in Example 1, by immunoscreening and hybridization assay. 

Firstly, immunoscreening of Xgt1 1 library was carried out using two serum samples from non-A non-B 
hepatitis patients which are negative for HBc and HBs antibodies and which contain antibodies specific for the 
hepatitis-causing virus. Immunoscreening was performed in the usual way by examining specific reaction of a 

20 0-galactosidase-fused recombinant peptide with a serum sample of non-A non-B hepatitis (to be referred to as 
"NANBH" herein after) patient. 

Cells of E. co// Y 1090 strain were mixed with Xgt11 cDNA library at a predetermined ratio, plated on an 
agar medium at an appropriate density, and then incubated at 43°C for 3 hours to form plaques. Next, the agar 
plate was covered with a Hybond-C nitrocellulose filter which has been soaked with 10 mM IPTG and the fil- 

25 ter-covered plate was incubated again at 37°C for 3 hours to induce expression. Subsequently, the nitrocellul- 
ose filter was subjected to blocking using 3% gelatin solution, reacted with a serum sample of NANBH patient 
overnight at 4°C, and then, after washing, reacted with a peroxidase-fabeled anti-human IgG (goat antibody). 
A positive signal was found when the resulting fitter was reacted with a mixture of diaminobenzidine and H 2 0 2 . 
This clone, C1 1-7, did not react with HBc and HBs antibodies. 

30 Next, in order to improve efficiency of the screening, the clone C1 1-7 was re-cloned into pUC1 1 9 and made 
into a probe by random primer method. Using the probe, Xgt10 cDNA library was screened by means of hyb- 
ridization assay. Screening was earned out according to the conventional method by plating 5x10* PFU of 
recombinant phages with E. coli G-600 hfl(-) on an L-plate (150 mm dish). When plaques appeared after over- 
night incubation of the plate at 37°C, the plate was stored at 4°C for 1 hour and thereafter the plate was covered 

35 with a Hybond-N filter for a period of 30 seconds. The resulting filter was superposed for 1 minute on a filter 
prewetted with a denaturating solution (0.5 M NaOH and 1.5 M NaCI), soaked for 5 minutes in a neutralizing 
solution (0.5 M Tris-HCI pH 7.0 and 1 .5 M NaCI), washed with 2 x SSC, and then dried. The filter was subjected 
to UV-crossI inking by exposing it to UV rays (304 nm) for 2 minutes. Thereafter, as described below, the result- 
ing filter was subjected to screening by hybridization assay using a ^-labeled DNA probe which has been 

40 prepared by random primer method from the C11-7 clone obtained by immunoscreening with a serum from 
NANBH patient 

The filter was incubated overnight at 65°C in 1 x SSC, washed twice with 1 x SSC at 65°C (10 minutes for 
each) and then subjected to autoradiography at -70°C for the detection of positive plaques. Each positive plaque 
was transferred into SM buffer and used as a phage stock. Clones obtained were used as marker probe to carry 
45 out a series of screening. As the results, 13 clones in total were isolated and designated as C10-11, C 10-13, 
C10-14, C10-15, C10-16, C10-17, C10-18, C10-19, C10-21, C10-22, C10-23 and C10-35. 

Example 3 ^ 

so Selection isolation of group 2 non-A non-B hepatitis-specific cDNA 

A blood plasma sample which can react only with C10-14 clone was obtained by subjecting fresh blood 
plasma of a Japanese patient in a chronic phase of the non-A non-B hepatitis to an ELISA-based screening 
system, using expressed products of the group 1 cDNA clone C1 1-7 and the group 2 cDNA clone C10-14 iso- 
55 lated in Examples 1 and 2. This blood plasma sample was subjected to a gene amplification method (PCR 
method) using well preserved primers of group 1 and those of group 2. PCR method was carried out using Gene 
Amp™ (DNA Amplification Reagent Kit, Perkin Elmer Cetus) under conditions of: DNA denaturation, 95°C for 
1.5 minutes; annealing, 55°C for 2 minutes; and DNA synthesis, 70°C for 3 minutes. Blood plasma samples in 
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which gene amplification was found only with the use of the group 2 primers under these conditions were pooled 
for further use. An RNA fraction was prepared from one liter of this fresh blood plasma sample in the same 
manner as in Example 1 , and a cDNA library (referred to as "cDNA library A" hereinafter) was constructed using 
Xgt10 and Xgt1 1 phages. The cDNA library A contained 10 6 to 10 7 PFU of recombinant phages in average. 

5 On the other hand, a cDNA library B was constructed using Xgt10 phage from five liters of fresh blood 

plasma samples which have been collected as starting material from several patients of non-A non-6 hepatitis 
and have not been subjected to the ELISA/PCR method, in the same manner as described above. The cDIMA 
library B also contained 10 8 to 10 7 PFU of recombinant phages in average. 

Cloning of non-A non-B hepatitis-specific cDNAfrom cDNA library A was carried out by immunoscreening 

10 in the same manner as in Example 2, and a positive plaque (clone C1 1 -C21 ) was obtained. The clone C1 1-C21 
showed no positive reaction with HBc and HBs antibodies. 

In order to improve efficiency of the screening, the thus obtained clone C11-C21 was re-cloned into 
pUC1 1 9, digested with restriction enzymes, and then made into a 32 P-labeled probe by random primer labeling 
method in the same manner as in Example 2. Using the probe obtained, the cDNA library B was screened by 

is hybridization assay. After a series of the screening efforts, 4 clones were isolated and named C10-E12, C10- 
E13,C10-E24andC10-E15. 

Example 4 

20 Sequencing of non-A non-B hepatitis-specific cDNA 

E coii cells were infected with the Xgt1 1 or Xgtl 0 phage of each of the 1 8 clones obtained in Examples 2 
and 3 to recover respective phage in a large quantity. DNA was extracted from the phage by the conventional 
alkali method, digested with a restriction enzyme EcoRI, BamH\ or Kpn\, and the resulting DNA fragments were 

25 purified by agarose gel electrophoresis. Separately from this, sequencing vectors mp18 and mp19 of M13 
phage (Messing,J.; Methods in Enzymology, 101_, 20 - 78) or pUC118 and pUC119 (VieiraJ. and Messing.J.; 
Methods in Enzymology, 153, 3-11) were digested with a restriction enzyme EcoRI, BamHI or Kpn\ to obtain 
linear vector fragments. The cDNA fragment and the vector DNA were linked together using T4 ligase in a buffer 
solution, and the resulting reaction product was incorporated into E. coii HB101 or JM109 strain by transfor- 

30 mation or transfection. Resulting E coii ceils were cultured and DNA was recovered by alkali method. Nuc- 
leotide sequence of the DNA obtained was determined according to the dideoxy chain termination method of 
Sanger er a/. 

The nucleotide sequences of clones C10-11, C10-13, C10-14, C10-15, C10-16, C10-17, C10-18, C10-19, 
C10-21, C10-22, C10-23, C1(W5, C10-C21, C10-E12, C10-E13, C10-E24 and C10-E15 and the amino acid 
35 sequences deduced from these nucleotide sequences are shown in a sequence table as SEQ ID NOs. 1 to 18 
and also in Figs. 1 to 18. 

On the basis of the comparison of homologies among these sequences and the nucleotide and deduced 
amino acid sequences disclosed by Houghton ef a/. (WO 89/04669, PCT/JP90/500880) and Miyamura etal(Nuc. 
Ad. Res., 17, 10367-10372(1989)), clones C11-7, C10-17, C10-18, C10-19, C10-21, C10-22 and C10-23 

40 obtained in Example 2 were classified as group 1 done as defined hereabove while clones C10-11, C10-13, 
C1 0-1 4, C1 0-1 5 and C1 0-35 were classified as group 2 clones. Evry of these 1 3 clones encoded non-structural 
protein of the non-A non-B type hepatitis virus. Moreover, clone C1 0-C21 in Example 3 was classified as group 
2 from the comparison of homology with the sequences described by Houghton et al (WO90/11089) and 
Okamoto et e\(Japan J. Exp.Med., 60, 3, pp.1 67-1 77 (1990)), but classification of the clones C1 0-E12, C10-E1 3, 

45 C10-E24 and C10-E15 in Example 3 is not still dear. However, it was found that these 5 dones encode the 
structural protein of non-A non-B hepatitis virus from the comparison of homology with the reported genome 
of Flavivinis (Protein, Nudelc Add and Enzyme (Japan), 35 (12), 2117-2127 (1990)). 

Example 5 

60 - 

Expression and purification of polypeptide encoded by non-A non-B type hepatitis virus cDNA 

(i) Construction of expression plasmid Trp TrpE C1 1-7: 

55 One of the dones isolated, C11-7, was expressed as a fused polypeptide with TrpE in E. coii under the 
control of Trp promoter(see Fig. 19). 

Firstly, 1 ng of a plasmid pUC-C1 1-7 DNA which has been obtained by incorporating the C1 1-7 done into 
pUC1 1 9 was digested by incubating it at 37°C for 1 hour in 20 uJ of a restriction enzyme reaction solution [1 50 
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mM NaCI, 6 mM Tris-HCt (pH 7.9), 6 mM MgCJ 2 , 1 5 units of BamHI enzyme and 1 5 units of Seal enzyme]. There- 
after, a BamH\-Scs\ fragment of about 700 bp was obtained by subjecting the resulting reaction solution to 0.8% 
agarose gel electrophoresis, and the fragment was purified by glass powder method (Gene Clean™, Bto-101). 
One \ig of TrpTrpE DNA which is an expression vector was digested by incubating it at 37°C for 1 hour in 

5 20 uJ of a reaction solution [150 mM NaCI, 6 mM Tris-HCI (pH 7.5), 6 mM MgCI 2 , 15 units of BamHI enzyme 
and 15 units of Seal enzyme]. After adding 39 (J of water, the resulting reaction solution was heat-treated at 
70°C for 5 minutes, mixed with 1 yd (250 U/^) of a bacterial alkaline phosphatase (BAP) and then incubated 
at 37°C for 1 hour. The reaction solution was subsequently extracted with phenol, the aqueou layer was sub- 
jected to ethanol precipitation followed by drying of the precipitate. One u-g of the BamHI-Scal-treated vector 

10 DNA obtained and the above C1 1-7 DNA fragment was added to 5 yd of 10 x ligase buffer [660 mM Tris-HCI 
(pH 7.5), 66 mM MgCI* 100 mM dithiothreitol and 1 mM ATP] and 1 ^l of T4 DNA ligase (350 U/yd), and water 
was then added to the mixture to 50 uJ of the final volume. Thereafter, the thus prepared mixture was incubated 
overnight at 16°C to complete ligation. 

E. coli HB101 strain was transformed with 10 ui of the resulting reaction solution. Competent E. coli strain 

15 for use in the transformation was prepared by calcium chloride technique [Mandel.M. and Higa A; J- Mol. Biol., 
53, 159 - 162 (1970)]. The transformed E. coli strain cells were spread on an LB-plate (1% trypton, 0.5% yeast 
extracts, 0.5% NaCI and 1.5% agar) containing 25 ag/ml of ampicillin and incubated overnight at 37°C. One 
loopfut of each colony grown on the plate was transferred into a liquid LB medium containing 25 itg/ml of ampicil- 
lin and cultured overnight at 37°C. Cells in 1.5 ml of the cultured medium were collected by centrifugation, and 

20 Miniprep of plasmid DNA was carried out by alkali method (Maniatis etal; Molecular Cloning: A Laboratory Man- 
ual, 1982). One u.g of the plasmid DNA obtained was digested at 37°C for 1 hour in 20 uJ of a reaction solution 
[150 mM NaCI, 6 mM Tris-HCI (pH 7.5), 6 mM MgCI 2 , 15 units of BamHI and 15 units of Seal]. Thereafter, the 
digested solution was subjected to agarose gel electrophoresis to obtain an expression plasmid TrpTrpEd 1 -7 
which can produce the 700 bp BamHI-Scal fragment This plasmid was transformed into E. coli HB101 strain 

25 and deposited on July 6, 1990 with Fermentation Research Institute, 1-3 Higashi 1-chome, Tsukuba-shi, Iba- 
raki-ken, 305, Japan, under the Accession Number FERM P-11590 (named E. coli HB101/Trp TrpE C11-7). 
This deposition was subsequently converted on June 13, 1991 to an international deposition under Budapest 
Treaty by the same depositary institution as an international depositary authority set forth in Budapest Treaty 
to be given the new Accession Number FERM BP-3443. 

30 

(ii) Expression and purification of polypeptide encoded by clone C1 1-7 : 

E. coli HB101 strain transformed with the expression plasmid Trp*TrpE*C11-7 was inoculated into 3 ml of 
a liquid 2YT medium (1.6% trypton, 1% yeast extracts and 0.5% NaCI) containing 50 u.g/ml of ampicillin and 

35 cultured at 37°C for 9 hours. One ml portion of the cultured broth was inoculated into 100 ml of a liquid M9-CA 
medium (0.6% Na 2 HP0 4 , 0.5% KH2PO4, 0.5% NaCI, 0.1% NH4CI, 0.1 mM Cad* 2 mM MgS0 4 , 0.5% casamino 
acid and 0.2% glucose) containing 50 jigAnl of ampicillin and cultured at 37°C for 21 hours. A 18-mJ portion of 
the resulting culture broth was then inoculated into 1.2 1 of the M9-CA medium and cultured at 37°C. When 
turbidity at ODeoo of the culture broth reached 0.3, Indole acrylate was added to a final concentration of 40 mg/l, 

40 and the culturing was continued for additional 1 6 hours. Cells collected from the final culture broth by centrifu- 
gation were suspended in 20 ml of buffer A [50 mM Tris-HCI (pH 8.0), 1 mM EDTA and 30 mM Nad] and the 
cell suspension was again subjected to centrifugation to obtain 2.6 g of expressed cells. The thus obtained cells 
were suspended In 10 ml of the buffer A, disintegrated by ultrasonic treatment, and then subjected to centrifu- 
gation to obtain an insoluble fraction containing a fused polypeptide of TrpE with a polypeptide which is encoded 

45 by the nort-A non-B type hepatitis virus cDIMA. The fused polypeptide in the insoluble fraction was solubflized 
and extracted using 10 ml of the buffer A containing 9 M urea. Thereafter, the solubilized extract was subjected 
to an S-Sepharose ion exchange column chromatography with an Nad gradient of from 0 M to 0.5 M to purify 
the fused polypeptide 

- r 

so (Hi) Construction of expression plasmid TrpTrpE-C11-C21 : 

The clone C1 1-C21 was expressed as a fused polypeptide with TrpE in E. coli under the control of a pro- 
moter (see Fig. 20). 

Firstly, 1 ng of plasmid pUC-C1 1-C21 DNA which has been obtained by incorporating C11-C21 clone into 
55 pUC1 1 9 was subjected to PCR method using two primers (5'-TTACGAATTCATGGGCACGAATCCT-3' and 5'- 
TTAATC GATGACCTT ACCCACATTGCG-3'). PCR method was carried out using Gene Amp™ kit (DNA Ampli- 
fication Reagent Kit, Perkin Elmer Cetus) under reaction conditions of: DNA denaturation, 95°C for 1 .5 minutes; 
annealing, 50°C for 2 minutes; and DNA synthesis, 70°C for 3 minutes. DNA fragments thus obtained were 



12 




EP 0 468 657 A2 

separated by 0.8% agarose gel electrophoresis and purified by glass powder technique. Separately from this, 
pUC118 was digested with a restriction enzyme Sroal and then ligated with the DNA fragment obtained by PCR 
method in a buffer solution containing T4 ligase to obtain a plasmid pUC1 18 C1 1-C21 Sma. One ug of the plas- 
mid DNA obtained was digested at 37°C for 1 hour in a restriction enzyme reaction solution [150 mM NaCI, 6 

5 mM Tris-HCI (pH 7.9), 6 mM Mga 2 , 15 units of EcoRI enzyme and 15 units of SamHI enzyme]. Thereafter, the 
resulting reaction mixture was subjected to 0.8% agarose gel electrophoresis to isolate an EcoRI-SamHI frag- 
ment of about 380 bp which was then purified by glass powder technique (Gene Clean™, Bio-101). 

Next, ligation and transformation were carried out substantially in the same manner as in the aforemen- 
tioned procedure (i) except that restriction digestion of the expression vector Trp-TrpE DNA was carried out 

w using EcoRI and SamHI instead of BamHI and Seal. Thereafter, an expression plasmid TrpTrpEC11-C21 
which can produce the EcoRI-BamHI fragment of about 380 bp was selected by agarose gel electrophoresis 
purification. This plasmid was transformed intoE. co//HB101 strain and deposited on December 1 1 , 1990 with 
Fermentation Research institute, Agency of Industrial Science and Technology, the same address, under the 
Accession Number FERM P-1 1893 (named E. coli HB101/TrpTrpEC1 1-C21). The deposition was also sub- 

15 sequentiy converted on June 17, 1991 to an international deposition under Budapest Treaty by the same 
depositary institution as an international depositary authority set forth in Budapest Treaty to be given the Acces- 
sion Number FERM BP-3451. 

(iv) Expression and purification of polypeptide encoded by clone C11-C21: 

20 

Expression and purification of a fused polypeptide were carried out substantially in the same manner as 
in the aforementioned procedure (ii), except that the expression plasmid TrpTrpEd 1-C21 obtained by the 
above procedure (iii) was used instead of Trp*TrpE C1 1-7. 

25 Example 6 

Measurement of anti-non-A non-B type hepatitis virus antibody in serum from non-A non-B hepatatis patient 
(i) Measurement by western blotting: 

30 

The expressed product obtained and purified in Example 5 was subjected in turn to SDS-polyacrylamide 
gel electrophoresis [Laemmf i; Nature, 277. 680 (1 970)] and to blotting on a nitrocellulose filter (Bio-Rad, Trans- 
blot) in usual way. The filter was blocked with a 3% gelatin solution and then reacted with each serum samples 
from normal persons or non-A non-B hepatitis patients. After washing, the resulting filter was reacted with a 
35 peroxidase-labeled human IgG (goat antibody). Thereafter, the filter was washed again and soaked in a solution 
containing diaminobenzidine as reaction substrate to confirm color development 

The results are shown in Figs. 21 and 22. In Fig. 21, the expressed polypeptide TrpEd1-7 (group 1) 
obtained In Example 5-{fi) was used as antigen, and in Fig. 22, the expressed polypeptide TrpE-C1 1-C21 (group 
2) in Example 5-(iv) was used. In each case, no reaction was observed with a normal serum sample, but a strong 
40 reaction with a patient's serum sample was found with a specific band. 

(ip Measurement by enzyme-linked immunosorbent assay (EUSA) : 

EUSA can be used as a means to make diagnosis of a large number of serum samples as compared to 

45 the case of western blotting method. EUSA was carried out as follows: 

A purified antigen sample was dSuted with PBS(-) to a concentration of 5 ug/ml and fixed to a micro-plate 
at 4°C or room temperature. After washing several times with a washing solution, a diluted serum sample to 
be detected was added to tfie resulting plate and incubated for 1 hour at 37°C or room temperature. After wash- 
ing, peroxidase-labeled anti-human IgG (goat antibody) was added and incubated at 37°C or room temperature 

so to complete the reaction. After washing several times, 50 uJ of a diaminobenzidine solution was added and incu- 
bated at 37°C to develop color. Thereafter, the coloring reaction was stopped with 2 M H2SO4 and the color 
was measured by a colorimeter. 

Positive ratios in the case of the use of the expressed polypeptide antigens, TrpE-C11-7 (group 1) and 
TrpE-d 1-C21 (group 2), of the present invention were compared with the case of the use of a commercially 

55 available kit of Chiron Corp. (Ortho HCV Ab ELISA Test). As shown in Table 1 , the use of the Chiron's kit resulted 
in 69.7% of the positive ratio, while positive ratios in the case of the use of the TrpE C1 1-7 and TrpE C1 1-C21 
were 78.8% and 84.8%, respectively Moreover, the positive ratio increased to 98.9% (30 of 31 cases) when 
these two expressed polypeptides of the present invention were used in combination (see Fig. 23). 
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Example 7 

Detection of non-A non-B type hepatitis virus group 2 gene in blood plasma from non-A non-B hepatitis 
patient by RT-PCR 

5 

RT-PCR was carried out as follows: 

To 100 uJ of a blood plasma sample collected from a non-A non-B hepatitis patient was added 300 ^ of a 
6 M GTC solution (6 M guanidine thiocyanate, 25 mM sodium citrate, 0.5% sarcosyl and 0.2 M 2-mercap- 
toethanol), and the mixture was stirred. To this were further added 40 pJ of 2 M sodium acetate (pH 5.2), 400 

10 uJ of phenol and 80 uJ of chloroform/isoamyl alcohol (49:1), and then thoroughly stirred. Aqueous solution layer 
separated from the mixture was mixed with isopropyl alcohol and then subjected to centrifugation. Synthesis 
of cDNA was carried out using the pellet as a source of RNA. For the cDNA synthesis, an RNase inhibitor and 
a reverse transcriptase were added to a reaction solution containing 1 0 mM Tris-HCI, 0.01 % gelatin, 1 mM each 
dNTP, 4 mM MgCI 2 , 1 mM DTT and 100 pmole each primer, and the mixture was incubated at 37°C for 2 hours 

is to complete the reaction. Then, PCR was carried out using the cDNA obtained. In order to increase sensitivity 
and specificity for the detection of bands, a two step PCR method was employed, that is, first PCR using two 
primers (1st step PCR) and subsequent PCR using two primers which exist inside the first PCR product (2nd 
step PCR). For the PCR reaction, each amplification cycle was carried out using 100 jd of a reaction solution 
containing cDNA, 10 mM Tris-HCI, 0.01% gelatin, 2 mM each dNTP, 1.5 mM MgCl 2 and 50 pmol each primer, 

20 under reaction conditions of: denaturation, 94°Cfor 1 .5 minutes; annealing, 50°C for 2 minutes; and chain elon- 
gation, 70°C for 2 minutes. The amplification cycle was repeated 35 times. Effects of several primers were 
evaluated. As the results, it was found that the group 2-specrfic DNA fragments are capable of being detected 
by the use of the following 4 primers: 
1st step PCR 

25 kk21: 5 # -GGATACACCGGTGACTTTGA-3' 

kk22: 5'-TGCATGCACGTGGCGATGTA-3' 
2nd step PCR 

kk26: 5'-GATGCCCACTTCCTCTCCCA-3' 
kk27: 5'-GTCAGGGTAACCTCGTTGGT-3' 
30 By applying these 4 primers to the PCR method, a DNA fragment of 206 bp can be detected. As a control, 
primers were synthesized from the base sequence of J1 and detection of group 1 DNA fragments was 
attempted. Results of the PCR from blood plasma samples of non-A non-B hepatitis patient are shown in Table 
2. 

It was known that DNA fragments from the non-A non-B hepatitis virus can be detected by both the PCRs 
35 using group 1 primers (i.e., group 1 PCR) and group 2 primers (i.e., group 2 PCR), and therefor two samples, 
Nos. 3 and 5, which are considered to include both groups 1- and 2-relating viruses were sequenced for their 
viral genes. As shown in Table 3, when nucleotide sequences of DNA fragments obtained by group 2 PCR were 
compared with C1 0-1 3 which is a group 2 clone, homologies of 85% and 88% were observed, indicating effec- 
tive detection of group 2 genes. When these two nucleotide sequences were compared with the aforementioned 
40 group 1 done J 1 (Miyamuraetal, supra), only 64.8% and 68% homologies were observed. Results of the homol- 
ogy evaluation Indicate that the primers used in the group 2 PCR can selectively detect group 2 viral genes. 
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40 Note: NK and NP are negative controls. 
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Table 2 
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Group 1 PCR 
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Table 3 






Sample No. 


Nucleotide homoloav with clone C10-13 



40 



45 



3 
5 



85% 
88% 



60 



55 



As seen from the foregoing examples, the present invention has the following advantages: 
The cDNA sequences according to the present invention are specific to non-A non-B hepatitis, and polypep- 
tides which are produced by incorporating these genes into a protein expression system In microbial host cells 
such as E. coff can react immunologically with sera samples from a number of non-A non-B hepatitis patients, 
whereby a kit for diagnosing non-A non-B hepatitis is capable of preparing with markedly high sensitivity and 
Judging accuracy. Also, it is possible to make diagnosis of this disease using said sequences as a probe directly 
or other probes with higher specificity synthesized on the basis of the sequences. In addition, not only diagnose 
of the disease but also isolation of non-A non-B hepatitis-specific genes can be accomplished by employing a 
gene amplification method (PCR method). 
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Sequence Listing 



SEQ ID NO: 1 

SEQUENCE LENGTH: 763 base pairs 
SEQUENCE TYPEioucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

CG CAG TCA TTC CAA GTG GCC CAT CTA CAC GCT CCC ACT GGC AGC GGC 47 
Gin Ser Pbe Gin Val Ala His Leu His Ala Pro Tbr Gly Ser G I y 
15 10 15 

AAG AGT ACT AAA GTG CCG GCT GCA TAT GCC AGC CAA GGG TAC AAG GTG 95 
Lys Ser Tbr Ljs Val Pro Ala Ala Tyr Ala Ser Gin Gly Ty r Lys Val 
20 25 30 

CTC GTC CTC AAC CCG TCC GTT GCC GCC ACC TTA GGT TTT GGA GCG TAT 14 3 
Leo Val Leu Asn Pro Ser Val Ala Ala Tbr Leo Gly Pbe Gly Ala Tyr 
35 40 45 

ATG TCT AAG GCA CAT GGC ACC GAC CCC AAC ATC AGA ACT GGG GTA AGG 191 
Met Ser Lys Ala His Glj Tbr Asp Pro Asn He Arg Tbr Gly Val Arg 
50 55 60 

ACT ATC ACC ACA GGC GCC CCC ATC ACC TAC. TCC ACC TAC GGC AAG TTC 239 
Tbr He Tbr Tbr Gly Ala Pro He Tbr Tyr Ser Tbr Tyr Gly Lys Pbe 
65 70 75 

CTT GCC GAC GGT GGT TGT TCT GGG GGC GCT TAT GAC ATC ATA ATG TGT 287 
Leo Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He Met Cys 
80 85 90 95 

GAT GAG TGC CAC TCA ACT GAC GCG ACT TCC ATC TTG GGC ATC GGC ACG 335 
Asp Glo Cys His Ser Tbr Asp Ala Tbr Ser He Leo Gly He Gly Tbr 
iTb 105 110 

GTC CTG GAC CAA GCG GAG ACG GCT GGA GCA CGG CTC GTC GTG CTC GCC * 383 
Val Leu Asp Gin Ala Glu Tbr Ala Gly Ala Arg Leo Val Val Leo Ala 
115 120 . 125 
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ACC GCT ACG CCT CCG GGA TCG GTC ACC GTG CCA CAC CCG AAT ATT GAG 431 

Thr Ala Tbr Pro Pro G 1 y Ser Val Tbr Val Pro His Pro Asa lie Glu 
130 135 140 

GAG GTG GCC CTG TCT AAC ACT GGA GAG ATC CCC TTC TAT GGC AAA GGC 479 

Glu Val Ala Leo Ser Aso Thr Gly Gin He Pro Ph e Ty r G 1 y Lys G 1 y 

145 150 155 

ATC CCC ATT GAA GTC ATC AAG GGG GGA AGG CAT CTC ATT TTC TGC CAT 527 

He Pro He Glu Yal He Lys Gly Gly Arg His Leo lie Ph e Cys His 

160 165 170 175 

TCC AAG AAG AAG TGC GAC GAG CTC GCC GCC AAG TTG TCA GGC CTC GGG 575 

Ser Lys Lys Lys Cys Asp Glo Leu Ala Ala Lys Lea Ser Gly Leu Gly 
180 185 190 

ATT AAT GCT GTG GCA TAC TAC CGG GGT CTT GAT GTG TCC GTC ATA CCG 623 

He Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val lie Pro 
195 200 205 

ACC AGC GGA GAC GTC GTT GTC GTG GCA ACA GAC GCT CTA ATG ACG GGC 671 

Thr Ser Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly 
210 215 220 

TAT ACC GGC GAT TTT GAC TCA GTG ATC GAC TGT AAC ACA TGC GTC ACC 719 

Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr Cys Val Tbr 

225 230 235 



CAG ACA GTC GAC TTC AGC TTG GAC CCC ACC TTC ACC ATT GAG AC 763 
Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Gin 
40 240 245 250 



SEQ ID NO: 2 
« SEQUENCE LENGTH: 615 base pairs 

SEQUENCE TYPEloucleic acid 

STRANDEDNESS: doubted 

TOPOLOGY! linear 
» MOLECULE TYPE:cDNA to genomic 

C ACG CCC GGT TTG CCC GTG TGT CAA GAC CAC CTG GAG TTC TGG GAA GCG 49 
Thr Pro Gly Leu Pro Val Cys Glo Asp Bis Leu Glu Phe Trp Glu Ala 
55 1 5 10 15 
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CTC TTC ACA CCT CTC ACG CAC ATT GAT GCC CAC TTC CTC TCC CAG ACA 

Val Ph e Thr G I y Leu Thr His He Asp Ala His Pbe Lea Ser Gin Th r 

20 25 30 

AAG CAA GGA GGA GAC AAC TTC GCG TAT CTA ACG GCC TAC CAG GCC ACA 

Lys Gin Gl; Glj Asp Aso Phe Ala Ty r Lea Thr Ala Ty r Glo Ala Thr 
35 40 45 

GTG TGC GCT AGG GCA AAG GCC CCT CCT CCC TCG TGG GAT GTG ATG TGG 

Val C;s Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp 
50 55 60 

AAA TGT CTA GCT AGG CTG AAG CCT ACA CTA ATT GGT CCT ACC CCC CTC 

lys Cys Lea Ala Arg Leo Ljs Pro Thr Leu He Gl; Pro Thr Pro Lett 

65 70 75 80 

CTG TAC CGC TTG GGT GCC GTG ACC AAC GAG GTT ACC CTG ACG CAC CCC 

Leu Tyr Arg Leu Gl; Ala Val Thr As n Glu Val Thr Leu Thr His Pro 

85 90 95 

GTG ACG AAA TAC ATC GCC ACG TGC ATG CAA GCT GAC CTC GAG ATC ATG 

Val Thr L;s Tyr He Ala Thr Cys Met Gin Ala Asp Leu Glo He Met 

100 105 110 

ACG AGC ACA TGG GTC CTA GCA GGG GGG GTG CTA GCC GCC GTG GCA GCT 

Thr Ser Thr Trp Val Leo Ala Gly Gly Val Leu Ala Ala Val Ala Ala 
IIS 120 125 

TAC TGC CTG GCA ACC GGC TGT GTT TCC ATC ATC GGC CGC CTA CAC CTG 

Tyr Cys Leo Ala Thr Gly Cys Val Ser lie He Gly Arg Lett Bis Lett 
130 13$ 140 

AAT GAT CAA GTG GTT GTG ACT CCT GAC AAA GAA ATC TTA TAT GAG GCC 

Aso Asp GIs Val Val Val Thr Pro Asp Lys Glo He Leo Tyr Git Ala 

145 150 155 160 

TTT GAT GAG ATG^CAA GAA TGC GCC TCC AAA GCC GCC CTC ATT GAG GAA 

Pbe Asp Glo Met Glu Glo Cys Ala Ser Lys Ala Ala Leo He Glo Glu 

165 170 175 

GGG CAG CCG ATG GCG GAG ATG CTC AAG TCT AAG ATA CAA GGC CTC CTA 

Gly Gin Arg Met Ala Glo Met Leu Lys Ser Lys lie Glo Gly Leu Leu 

180 185 190 
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CAA CAG GCC ACA AGA CAG GCC CAA GAC ATA CAG CCA GC 
G I d Gin Ala Tbr Are G I □ Ah Gin Asp lie Gin Pro 
195 200 



SEQ ID NO: 3 

SEQUENCE LENGTH: 771 base pairs 
SEQUENCE TYPE: duc I e ic acid 
STRANDEDNESS: doable 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

GT GAG CGA GCC TCA GGA ATG TTT GAC ACT GTA GTG CTC TGT GAG TGC 
Gin Arg Ala Ser G 1 y Met Phe Asp Ser Val Val Leo Cys Glu Cys 
15 10 15 

TAT GAC GCA GGG GCT GCA TGG TAC GAG CTT ACA CCA GCG GAG ACC ACC 
Tyr Asp Ala Glj Ala Ala Trp Tyr Glu Leu Tbr Pro Ala Glo Thr Tbr 
20 25 30 

GTC AGG CTC AGA GCG TAT TTC AAC ACA CCT GGC TTG CCT GTG TGT CAA 
Val Arg Leo Arg Ala Tyr Phe Asn Tbr Pro Gly Leu Pro Val Cys Gin 
35 40 45 

GAC CAT CTT GAG TTC TGG GAG GCA GTT TTC ACC GGC CTC ACA CAC ATA 
Asp His Leu Glu Pbe Trp Glu Ala Val Phe Thr Gly Leu Tbr His lie 
50 55 60 

GAT GCC CAC TTC CTT TCC CAG ACA AAG CAA GCA GGG GAC AAT TTC GCA 
Asp Ala His Pbe Leo Ser Gin Tbr Lys Gin Ala Gly Asp Asn Pbe Ala 
65 70 75 

TAC TTG ACA GCC TAC CAG GCT ACA GTG TGC GCC AGA GCC AAA GCC CCT 
Tyr Leu Thr Ala Tyr Gin Ala Tbr Val Cys Ala Arg Ala Lys Ala Pro 
80 85 90 95 

CCC CCG TCC TGG GAC GTC ATG TGG AAG TGC CTG ACT CGG CTC AAG CCC 
Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr Arg Leu Lys Pro 
100 105 110 

ACG CTT GTG GCC CCT ACA CCC CTT CTG TAC CGT TTA GGC TCT GTT ACT 
Thr Leu Val Ala Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ser Val Thr 
115 120 125 
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AAC GAG GTC ACC CTC ACA CAT CCT GTG ACG AAA TAC ATC GCC ACT TGC 431 
Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tj r He Ala Thr Cys 
130 135 140 

ATG CAA GCT GAC CTT GAG GTC ATG ACC AGC ACG TGG GTC CTA GCT GGG 479 
Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leo Ala Gly 
145 150 155 

GGG GTC .TTG GCA GCC GTC GCC GCG TAT TGC CTG GCG ACT GGG TGT GTC 527 
Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val 
160 165 170 175 

TCC ATC ATC GGC CGC TTG CAC ATC AAT CAG CGA GCC GTC GTT GCA CCA 575 
Ser He He Gly Arg Leu His He Asa Gin Arg Ala Val Val Ala Pro 
180 185 190 

GAC AAG GAG GTC CTT TAT GAG GCT TTT GAT GAG ATG GAG GAG TGT GCC 623 
Asp Lys Glu Val Leu Tyr Glu Ala Pbe Asp Glu Met Glu Glu Cys Ala 
195 200 205 

TCT AAA GCG GCT CTC ATT GAA GAG GGG CAG CGG ATA GCC GAG ATG CTG 671 
Ser Lys Ala Ala Leu lie Glu Glu Gly Gin Arg He Ala Glu Met Leu 
210 215 220 

AAG TCC AAG ATC CAA GGC TTA TTG CAG CAA GCC TCT AAA CAG GCC CAG 719 
Lys Ser Lys He Gin Gly Leu Leu Gin Gin. Ala Ser Lys Gin Ala Gin 
225 230 235 

GAC ATA CAA CCC GCT GTG CAG CCT CAT GGC CCA AGG TGG AGC AAT TCT 767 
Asp He Gin Pro Ala Val Gin Pro His Gly Pro Arg Trp Ser Asn Ser 
240 245 250 255 

GGG C 771 
Gly 



SEQ ID NO: 4 
so SEQUENCE LENGTH: 630 base pairs 

SEQUENCE TYPE: duc I e i c acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
55 MOLECULE TYPE: cDNA to genomic RNA 
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C TGG TAT GAA CTT ACG CCT GCT GAG ACT ACG GTG AGA CTC CGG GCC TAT 49 
Trp Tjr Glu Leu Tb r Pro Ala Glu Thr Th r Val A r g Leo Arg Ala Tyr 
1 5 10 15 

TTC AAC ACG CCC GGC CTG CCT GTG TGT CAA GAC CAC CTG GAA TTC TGG 97 
Phe Asn Thr Pro G 1 y Lea Pro Val Cys Gin Asp Bis Leu Glu Phe Trp 
20 25 30 

GAG GCG GTC TTC ACA GGT CTC ACA CAC ATC GAT GCC CAC TTC CTC TCC 145 
Glu Ala Val Phe Thr Gly Leu Thr His He Asp Ala His Phe Leu Ser 
35 40 45 

CAG ACG AAG CAA GGA GGA GAT AAC TTT GCA TAT TTA ACA GCC TAC CAG 193 
Gin Thr Lys Gin Gly Gly Asp Asn Phe Ala Tyr Leu Thr Ala Tyr Gin 
50 55 60 

GCC ACA GTC TGC GCT AGG GCA AAG GCT CCC CCT CCT TCG TGG GAC GTG 241 
Ala Thr Val Cys Ala Art Ala Lys Ala Pro Pro Pro Ser Trp Asp Val 
65 70 75 80 



ATG TGG AAG TGT TTG ATT AGG CTC AAA CCT ACA CTG ACT GGT CCT ACC 289 

30 Met Trp Lys Cys Leu He Arg Leu Lys Pro Thr Leu Thr Gly fro Thr 

85 90 95 

CCC CTC CTG TAC CGC TTG GGT GCC GTG ACC AAC GAG GTT ACC CTG ACT 337 

3s Pro Leo Leu Tyr Arg Leu Gly Ala Val Thr Asn Glu Val Thr Leu Thr 
100 105 110 

CAC CCC ATG ACG AAA TAT ATC GCC ACT TGT ATG CAA GCT GAT CTT GAG 385 

40 His Pro Met Thr Lys Tyr He Ala Thr Cys Met Gin Ala Asp leu Glu 
115 120 125 

ATC ATG ACA AGC ACA TGG GTC TTG GCG GGG GGG GTG CTA GCC GCT GTG 433 

4s He Met Thr Ser Thr Trp Val Leu Ala Gly Gly Val. Leu Ala Ala Val 

130 135 140 

GCA GCT TAC TGC CTA GCG ACC GGC TGC ATT TCC ATC ATT GGC CGC CTT 481 

» Ala Ala Tyr Cys Leu Ala Thr Gly Cys He. Ser He He Gly Arg Leu * 

145 150 155 160 



CAC CTG AAT GAT CGG GTG GTC GTG ACC CCT GAT AAG GAA ATT TTA TAT 529 
His Leu Asn Asp Arg Val Val Val Thr Pro Asp Lys Glu He Leu Tyr 
165 170 175 
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GAG GCC TTT GAT GAG. ATG GAA GAG TGC GCC TCC AAA GCC GCC CTC ATT 577 
Glu Ala Fhe Asp Glo Met Glu Gin Cys Ala Ser Lys Ala Ala Leo He 
1 8 0 IS5 1 9 0 

GAG GAA GGG CAG CGG ATG GCG GAG ATG CTG AAG TCT AAA ATA CAA GGC 625 
Glo Glo Gly GU Art Met Ala Glu Met Lea Lys Ser Ljs He Gin Gly 
1 95 200 205 

CTC TT 630 
Leu 



SEQ ID NO! 5 

SEQUENCE LENGTH: 1426 base pairs 
20 SEQUENCE TYPEloucIeic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPElcDNA to genomic RNA 



55 



GGG ATC AAC CCT AAC ATC AGG ACC GGA GTA CGG ACC GTG ACC ACC GGG 48 

Gly lie Aso Pro Asa He Arg Th r Gly Yal Arg Tkr Val Thr Thr Gly 

15 10 15 

GAC TCC ATC ACC TAC TCC ACT TAT GGC AAG TTT ATC GCA GAT GGA GGT 96 

Asp Ser He Thr Tyr Ser Thr Tyr Gly Lys Pie He Ala Asp Gly Gly 

20 25 30 

TGC GCA CAT GGT GCC TAT GAC GTC ATC ATA TGC GAC GAA TGC CAT TCA 144 

Cys Ala Bis Gly Ala Tyr Asp Yal He He Cys Asp Glo Cys Bis Ser 

35 40 45 

GTG GAC GCT ACT ACC ATC CTT GGC ATT GGA ACA GTC CTT GAC CAG GCT 192 

Yal Asp Ala Thr Thr He Leu Gly He Gly Thr Yal Leo Asp Glo Ala 
50 55 60 

GAG ACC GCA GGT GCC AGG CTA GTG GTT TTA GCC ACA GCC ACG CCA CCC 240 

Glo Thr Ala Gly Ala Arg. Leo Val Yal Leo Ala Thr Ala Thr Pro Pro 

65 70 75 80 . 

GGT ACG GTA ACA ACT CCC CAC GCT AAC ATA GAG GAG GTG GCC CTT GGT 288 

Gly Thr Val Thr Thr Pro His Ala Aso He Glo Glo Val Ala Leu Gly 

85 90 95 
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CAC GAA GGC GAG ATT CCT TTT TAT GGC AAG GCT ATT CCC CTA GCT TTC 
His G I u G 1 y C I o He Pro P h e Ty r Gly L y s Ala He Pro Leo Ala Phe 
100 105 110 



336 



10 



ATC AAG GGG GGC AGA CAC CTA ATT TTT TGC CAT TCA AAG AAG AAG TGC 
lie Ly s Gly Gly Arg His Lea lie Phe Cys His Ser Lys Lys Lys Cys 
IIS I20 12$ 



384 



f5 



GAC GAG CTC GCA GCA GCC CTT CGG GGC ATG GGT ATC AAT GCC GTT GCC 
Asp Glu Leo Ala Ala Ala Leo Arg Gly Met Gly He As n Ala Val Ala 
130 135 140 



432 



20 



TAC TAC AGG GGT CTC GAC GTC TCC GTT ATA CCA ACT CAA GGA GAC GTG 
Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Gin Gly Asp Val 
I4S 150 155 160 



480 



25 



GTG GTT GTC GCC ACC GAT GCC CTA ATG ACT GGA TAC ACC GGT GAC TTT 
Val Val Val Ala Tlr Asp Ala Leu Met Thr Gly Tyr Th r Gly Asp Phe 
165 170 175 



528 



30 



GAC TCT GTC ATC GAC TGC AAC GTT GCA GTC ACT CAG ATT GTT GAC TTT 
Asp Ser Val He Asp Cys Asa Val Ala Val Thr Gin He Val Asp Phe 
180 185 190 



576 



35 



AGC CTA GAC CCA ACT TTT ACC ATC ACC ACT CAA ACC GTC CCT CAG GAG 
Ser Leo Asp Pro Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Glu 
195 200 205 



624 



40 



GCT GTC TCC CGT AGT CAA CGT AGA GGG AGA ACT GGG AGG GGG CGA CTG 
Ala Val Ser Arg Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leo 
210 215 220 



672 



45 



GGC ACT TAC AGG TAT GTC TCG TCA GGC GAG AGG CCG TCT GGG ATG TTC 
Gly Thr Tyr Arg Tyr Val Ser Ser Gly Gin Arg Pro Ser Gly Met Phe 
225 230 235 240 



720 



so 



GAC AGC GTA GTA CTC TGC GAG TGC TAT GAT GCC GGG GCA GCC TGG TAC 
Asp Ser Val Val Lea Cys Glo Cys Tyr Asp Ala Gly Ala Ala Tip Tyr 
245 250 255 



768 
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GAG CTT ACA CCT GCT GAG ACC ACA GTG AGA CTC CGG GCT TAT TTC AAC 
Glu Leo Thr Pro Ala Glu Thr Thr Val Arg Leo Arg Ala Tyr Phe Asa 
260 265 270 



816 
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ACC CCC CGT TTC CCC GTG TGT CAA GAC CAC CTG GAG TTC TGG GAA GCG 
Thr Pro Glj Leu Pro Val Cys Glo Asp Bis Leo Glo Phe Trp Glo Ala 
275 280 28$ 

GTC TTC ACA GGT CTC ACG CAC ATT GAT GCC CAC TTC CTC TCC CAG ACA 

Val Phe Thr Gly Leu Tbr Bis lie Asp Ala Bis Phe Leo Ser Glo Thr 
290 295 300 

AAG CAA GGA GGA GAC AAC TTC GCG TAT CTA ACG GCC TAC CAG GCC ACA 

Ljs Glo Glj Gly Asp Aso Phe Ala Tjr Leo Thr Ala Tj r Glo Ala Thr 
305 310 315 320 

GTC TGC GCT AGG GCA AAG GCC CCT CCT CCC TCG TGG GAT GTG ATG TGG 

Val Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp 
325 330 335 

AAA TGT CTA GCT AGG CTG AAG CCT ACA CTA ATT GGT CCT ACC CCC CTC 

Lys Cys Leu Ala Arg Leu Lys Pro Thr Leo He Glj Pro Thr Pro Leu 

340 345 350 

CTG TAC CGC TTG GGT GCC GTG ACC AAC GAG GTT ACC CTG ACG CAC CCC 

Leu Tyr Arg Leu Gly Ala Val Thr Aso Glo Val Thr Leu Thr Bis Pro 
355 360 365 

GTG ACG AAA TAC ATC GCC ACG TGC ATG CAA GTG AAC CTC GAG ATC ATG 

Val Thr Lys Tyr He Ala Thr Cys Met Gin Val Aso Leo Glu He Met 
370 375 380 

ACG AGC ACA TGG GTC CTA GCA GGG GGG GTG CTA GCC GCC GTG GCA GCT 

Thr Ser Thr Trp Val Leo Ala Gly Gly Val Leo Ala Al.a Val Ala Ala 
385 390 395 400 

TAC TGC CTG GCA ACC GGC TGT GTT TCC ATC ATC CGC CGC CTA CAC CTG 

Tyr Cys Leo Ala Thr Gly Cys Val Ser He lie Gly Arg leo His Leo 
405 410 415 

AAT GAT CAA GTG GtTGTG ACT CCT GAC AAA GAA ATC TTA TAT CAG GCC 

Aso Asp Gin Val Val Val Thr Pro Asp Lys Glo He Leo Tyr Glo Ala 

420 425 430 

TTT GAT GAG ATG GAA GAA TGC GCC TCC AAA GCC GCC CTC ATT CAG GAA 

Phe Asp Glo Met Glu Glo Cys Ala Ser Lys Ala Ala Leo lie Glo Glo 
435 440 445 
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GGG CAG CGG ATG GCG GAG ATG CTC AAG TCT AAG ATA CAA GGC CTC CTA 1392 

Gly Gin Arg Met Ala G I a Met Leo Ly s Ser Ly s He Clo Gly Leu Leo 
450 455 460 

CAA CAG GCC ACA AGA CAG GCC CAA GAC ATA CAG C 1426 

Gin Glo Ala Thr Arg Gin Ala Glo Asp Tie Glo 
465 470 475 



a SEQ ID NO: 6 

SEQUENCE LENGTH: 855 base pairs 

SEQUENCE TYPE: ouc !e ic acid 

STRANDEDNESS: double 
20 TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

CG CAG ACA TTC CAA GTG GCC CAT CTG CAC GCT CCC ACT GGT AGC GGC 47 
25 Gin Thr Phe Glo Val Ala Bis Leu Bis Ala Pro Thr Gly Ser Gly 
1 5 10 15 

AAG AGC ACT AAG GTG CCG GCT GCA TAT GCG GCC CAA GGG TAC AAG GTA 95 
so Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val 

20 25 30 

CTC GTC CTG AAC CCG TCC GTT GCC GCC ACT TTA GCC TTT GGG GCG TAC 143 
35 Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Ala Phe Gly Ala Tyr 
35 40 45 



ATG TCT AAG GCA CAT GGT GTC GAC CCT AAC ATC AGA ACT GGG GTG AGG 191 

Met Ser Lys Ala His Gly Val Asp Pro Asa lie Arg Thr Gly Val Arg 

50 55 60 

ACC ATC ACC ACG GGC GCT CCC ATC ACG TAC TCC ACC TAT GGT AAG TTC 239 

Thr He Tbr Thr Gly Ala Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe 

65 70 75 

CTT GCC GAC GGT GGT TGC TCT GGG GGC GCC TAT GAC ATC ATA ATA TGT 287 

Len Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He He Cys ' 

80 85 90 95 

GAT GAG TGC CAC TCA ACT GAC TCG ACA TCC ATC TTG GGC ATC GGC ACA 335 

Asp Clo Cys His Ser Thr Asp Ser Thr Ser He Leu Gly He Gly Thr 

100 105 110 
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GTC CTG GAC CAA GCG GAG ACG GCT GGA GCG CGG CTC GTC GTG CTC GCT 
Yal Lea Asp Gin Ala G 1 u Thr Ala G I y Ala Arg Leo Val Val Leo Ala 
115 120 125 

ACC GCT ACG CCT CCG GGA TCG GTC ACC GTG CCA CAT CCC AAT ATC GAG 
Tbr Ala Tkr Pro Pro Glj Ser Yal Thr Val Pro Bis Pro Aso He Glu 
130 135 140 

GAG GTG GCC CTG TCC ACC ACT GGA GAG ATT CCC TTC TAC GGC AAA GCT 
Glu Val Ala Leo Ser Tbr Thr Gly Glu He Pro Phe Tyr Gly Ly s Ala 
H5 150 155 

ATC CCC ATC GAG ACA ATC AAG GGG GGG AGG CAT CTC ATC TTC TGC CGT 
He Pro lie Glo Thr lie Lys Gly Gly Arg His Leu He Phe Cy s Arg 
160 165 170 175 

TCC AAG AAG AAG TGT GAC GAG CTC GCT GGA AAG CTG TCA GCC CTC GGA 
Ser Lys Lys Lys Cys Asp Glu Leo Ala Gly Lys Leu Ser Ala Leu Gly 
180 1S5 190 

ATC AAC GCT GTA GCG TAC TAC CGG GGT CTT GAT GTA TCC GTC ATA CCG 
He Aso Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro 
195 200 205 

ACC AGC GGA GAC GTC GTT GTC GTG GCA ACA GAC GCT CTA ATG ACG GGC 
Thr Ser Gly Asp Val Val Val Val Ala Thr. Asp Ala Leu Met Tbr Gly 
210 215 220 

TAC ACC GGT GAC TTT GAT TCA GTG ATC GAC TGC AAT ACA TGT GTC ACC 
Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Aso Thr Cys Val Thr 
225 230 235 

CAG ACA GTC GAC TTC AGC TTG GAC CCT ACC TTC ACC ATT GAG ACG ACG 
Glo Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr Thr 
240 245 250 255 

ACC GTG CCT CAA GaTgCG GTG TCA CGC TCG CAG CGG CGA GGC AGA ACT 
Thr Yal Pro Glo Asp Ala Val Ser Arg Ser Glo Arg Arg Gly Arg Thr - 
260 265 270 

GGT AGG GGT AGA GGG GGC ATA TAC AGG TTT GTG ACT CCA G 
Gly Arg Gly Arg Gly Gly He Tyr Arg Phe Val Thr Pro 
275 280 
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SEQ ID HO: 7 

SEQUENCE LENGTH: 3 15 base pairs 
SEQUENCE TYPE : ou c I e i c acid 
STRANDEDNESS : dou b le 
TOPOLOGY: linear 

MOIECOLE TYPE:cDNA to geoonic RNA 

GAC GAG CTC GCC GCA AAG CTG TCA GGC CTC GGA GTC AAT GCT GTG GCA 

Asp Gin Leu Ala Ala L j s Leu Ser G 1 y Leo G ! y Val Aso Ala Val Ala 

1 5 10 15 

TAC TAC CGG GGT CTC GAT GTG TCT GTC ATA CCG ACG AGC GGG GAC GTC 

Ty r Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 

20 25 30 

GTT GTT GTG GCA ACA GAC GCT CTA ATG ACG GGC TAT ACC GGC GAC TTT 

Val Val Val Ala Thr Asp Ala Leu Met Thr G 1 y Tyr Thr Gly Asp Phe 

35 40 45 

GAC TCG GTG ATC GAC TGC AAT ACA TGT GTC ACC CAA ACA GTC GAT TTC 

Asp Ser Val He Asp Cys Asd Thr Cys Val Thr Gin Thr Val Asp Phe 

50 55 60 

AGC TTG GAC CCT ACT TTC ACC ATT GAG ACG ACG ACC GTG CCC CAA GAC 

Ser Leu Asp Pro Thr Phe Thr lie Glu Thr. Thr Thr Val Pro Gin Asp 

65 70 . 75 80 

GCG GTG TCG CGC TCG CAG CGG CGA GGC AGG ACT GGT AGG GGC AGC GTG 

Ala Val Ser Arg Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Val 

lb 90 95 

GGC ATA TAC AGG TTT GTG ACT CCC GAG 

Gly He Tyr Arg Phe Val Thr Pro Gin 

100 105 
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SEQ ID NO: 8 

SEQUENCE LENGTH: 91 1 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 



GT GAT GAG CTC GCC GCA AAG CTC TCA AGC CTC GGA CTC AAC GCT GTA 
Asp Glu Leo Ala Ala Lys Leu Ser Ser Leu Gly Leu Asn Ala Yal 

15 10 15 

GCA TAT TAC CGG GGT CTT GAT GTG TCC GTC ATA CCG ACT AGT GGA GAC 

Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 

20 25 30 

GTC GTT GTC GTG GCA ACA GAC GCT CTA ATG ACG GGC TAT ACC GGC GAC 

Yal Yal Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 
35 40 45 

TTT GAC TCA GTG ATC GAC TGT AAC ACA TGT GTC ACC CAG ACA GTT GAT 

Phe Asp Ser Val He Asp Cys Aso Thr Cys Val Thr Gin Thr Val Asp 

50 55 60 

TTC AGC TTG GAT CCA ACC TTC ACC ATT GAG ACG ACG ACC GTG CCT CAA 

Phe Ser Leu Asp Pro Thr Phe Thr He Glu. Thr Thr Thr Val Pro Gin 
65 70 75 



GAC GCG GTG TCG CGC TCG CAG CGG CGA GGT AGG ACT GGC AGG GGC AGG 
Asp Ala Val Ser Arg Ser Gin Arg Afg Gly Arg Thr Gly Arg Gly Arg 
80 85 90 95' 



GGC GGC ATC TAT AGG TTT GTG ACT CCA GGA GAA CGG CCC TCG GGC ATG 

Gly Gly lie Tyr Arg Phe Val Thr Pro Gly Glu Arg Pro Ser Gly Met 

100 105 110 

TTC GAT TCC TCG GTC CTG TGT GAG TGT TAT GAC GCG GGC TGT GCT TGG 

Phe Asp Ser Ser Yri-Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp 

115 120 125 



TAT GAG CTC ACG CCC GCC GAG ACC ACG GTT AGG TTG CGG GCT TAC CTA 
Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Leu 
130 135 140 
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AAT ACA CCA GGG TTG CCC GTC TGC CAG GAC CAT CTG GAG TTC TGG GAG 

Asd Tbr Fro Gly Leu Pro Val Cys Glo Asp His Leu Glu Pie Trp Glu 
14$ ISO 15$ 

GGC GTC TTC ACA GGC CTC ACC CAC ATA GAT GCC CAT TTC TTG TCT CAG 

G 1 y Val Phe Tlr Gly Leu Tar Bis He Asp Ala Bis Phe Leu Ser Gin 

160 16$ 170 175 

ACT AAG CAG GCA GGA CAC AAC TTT CCC TAC CTG GTG GCA TAC CAA GCT 

Tbr Lys Gin Ala Gly Bis Asd Phe Pro Tjr Leu Val Ala Tyr Glo Ala 

180 1 S 5 190 

ACA GTG TGC GCC AGG GCT CAG GCT CCA CCT CCA TCG TGG GAC CAA ATG 

Th r Val Cys Ala Are Ala Glo Ala Pro Pro Pro Ser Trp Asp Gin Met 

195 200 205 

TGG AAG TGT CTC ATA CGG CTG AAA CCT ACG CTG CAC GGG CCA ACA CCC 

Trp Lys Cys Leu He Arg Leu Lys Pro Tbr Leu His Gly Pro Th r Pro 

210 215 220 

CTG CTG TAT AGG CTA GGA GCC GTG GAA AAT GAG GTC ACC CTC ACA CAC 

Leu Leo Tyr Arg Leu Gly Ala Val Glu Aso Glu Val Tbr Leu Th f His 
22$ 230 235 

CCC ATA ACC AAA TTC ATC ATG GCA TGC ATG. TCG GCT GAT CTG GAG GTC 

Pro Me Tbr Lys Phe lie Met Ala Cys Met. Ser Ala Asp Leo Glu Val 

240 245 250 255 

GTC ACC AGC ACC TGG CTG CTG GTG GGC GGA GTC CTT GCA GCT CTG GCC 

Val Tbr Ser Tbr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala 

260 265 270 

GCA TAT CGC CTG ACA ACA GGC AGC GTG GTC ATC GTG GGT AGG ATC ATC 

Ala Tyr Arg Leo Tbr Tbr Gly Ser Val Val He Val Gly Arg lie He 

275 280 285 

TTG TCT GGG AGG CCG GCT GTC ATT CCC GAC AGG GAA GTC CTT TAC CGG 

Leu Ser Gly Arg Pro Ala Val He Pro Asp Arg Glu Val Leo Tyr Arg 

290 295 300 
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SEQ ID NO: 9 

SEQUENCE LENGTH: 4 89 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: doable 
TOPOLOGY: linear 

MOLECULE TYPElcDNA to genomic RNA 

CG ACA ACC GTG CCC CAA GAC GCG GTG TCG CGC TCA CAA CGG CGG GGT 47 
Thr Thr Va I Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Glj 
15 10 15 

AGG ACA GGT AGG GGC AGG AGA GGC ATC TAC AGA TTT GTG ACT CCG GGA 95 
Arg Thr Gly Arg Gly Arg Arg Gly lie Tjr Arg Pbe Val Thr Pro G 1 y 
20 25 30 

GAA CGG CCC TCG GGC ATG TTC GAT TCT TCG GTC CTG TGT GAG TGC TAT 143 
G I u Arg Pro Ser Gly Met Pie Asp Ser Ser Val Leu Cys Glu Cys Tyr 
35 40 45 

GAC GCG GGC TGC GCT TGG ATC GAG CTC ACG CCC GCC GAG ACC TCA GTT 191 
Asp Ala Gly Cys Ala Trp He Glo Leu Tbr Pro Ala Glu Thr Ser Val 
50 55 60 

AGG TTG CGG GCT TAC CTA AAT ACA CCA GGG TTG CCC GTC TGC CAG GAC 239 
Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
65 70 75 

CAC CTG GAA TTC TGG GAG AGC GTC TTC ACA GGC CTC ACC CAT ATA GAT 287 
His Leu Glo Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His lie Asp 
80 85 90 95 

GCC CAC TTC TTG TCC CAG ACC AAG CAG GCA GGA GAC AAC TTC CCC TAC 335 
Ala His Phe Lea Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
100 105 110 

CTG GTA GCA TAC CAA GCT ACA GTG TGC GCC AGG GCC CAG GCT CCA CCA 383 
Leu Val Ala Tyr Gin Ah Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
115 120 125 

CCA TCG TGG GAT CAA ATG TGG AAG TGT CTC ATA CGG CTG AAA CCT ACG 431 
Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie Arg- Leu Lys Pro Thr 
130 135 140 
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CTA CAC CGG CCA ACA CCC CTG TTG TAT AGG CTG GGA GCC GTC CAA AAT 
Leo His G I y Pro Tbr Pro Leo Leo Ty r Arg Lea G I y Ah V a I Glo Aso 
145 150 155 

GAG GTC ACC C 
Glo Yal Thr 
160 



SEQ ID NO: 1 0 

SEQUENCE LENGTH: 1076 base pairs 
SEQUENCE TYPE : nuc 1 e i c acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPEicDNA to genomic RNA 



GT GGT CTC CTG GGT GCC ATC GTG GTC AGC CTA ACG GGC CGC GAC AAG 
Glj Leu Leu Gly Ala He Yal Val Ser Leu Tbr Glj Arg Asp Lys 
15 10 15 

AAC CAG GTC GAG GGG GAG GTT CAG GTG GTC TCC ACC GCA ACG CAA TCT 
Asn Gin Val Glu Gly Glu Yal Gin Val Val Ser Tbr Ala Tbr Gin Ser 
20 25 30 

TTC CTG GCG ACC TGC GTC AAT GGC GTG TGT TGG ACC GTC TAC CAT GGC 
Pbe Leu Ala Tbr Cjs Val Asn Gly Val Cys .Trp Tbr Val Tyr His Gly 
35 40 45 

GCC GGC TCG AAA ACC CTG GCC GGC CCG AAG GGT CCA GTC ACC CAA ATG 
Ala Gly Ser Lys Tbr Leo Ala Gly Pro lys Gly Fro Val Tbr Gin Met 
50 55 60 

TAC ACT AAT GTG GAC CAG GAC CTC GTC GGC TGG CCG GCG CCC TCC GGG 
Tyr Tbr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Ser Gly 

65 70 75 

GCG CGG TCC TTG ACT CCA TGC ACC TGC GGC AGC TCG GAC CTT TAC TTG 
Ala Arg Ser Leu Tbr Pro Cys Tbr Cys Gly Ser Ser Asp Leu Tyr Leu 
80 85 90 95 



GTC ACG AGG CAT GCT GAT GTC ATT CCG GTG CGC CGG CGG GGC GAT AGC 
Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser 
100 105 110 
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AGG GGG AGC CTG CTT TCC CCC AGG CCC CTC TCC TAC TTG AAG GGC TCC 

Arc G 1 t Ser Leu Leu Set Pro Arg Pro Leo Ser Tyr Leo Ly s G I y Ser 

115 120 125 

TCA GGT GGT CCA CTG CTT TGC CCC TCG GGG CAC ATT GTG GGC ATC TTC 

Ser Glj Gly Pro Leu Leu Cys Pro Ser Gly His lie Val Gly He Phe 
130 135 HO 

CGG GCT GCC GTG TGC ACC CGG GGG GTT GCG AAG GCG GTG GAC TTT GTA 

Arc Ala Ala Yal Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val 

145 150 155 

CCT GTC GAG TCT ATG GAA ACT ACT ATG CGG TCT CCG GTC TTC ACG GAT 

Pro Yal Glu Ser Met Glu Thr Thr Met Arg Ser Pro Yal Phe Thr Asp 

1(0 165 170 175 

AAT TCA TCC CCC CCG GCC GTA CCG CAG ACA TTC CAA GTG GCC CAT CTG 

Asn Ser Ser Pro Pro Ala Val Pro GId Thr Phe Glo Val Ala Bis Leu 

ISO 1S5 190 

CAT GCC CCC ACT GGC AGC GGC AAG AGC ACT AAG GTG CCG GCT GCA TAC 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr 

195 200 205 

GCA GCC CAG GGA TAC AAG GTA CTC GTC CTG AAC CCG TCC GTT GCC GCC 

Ala Ala Gin Gly Tyr Lys Val leu Val Leu Asn Pro Ser Val Ala Ala 
210 215 .220 

ACC TTA GGT TTT GGA GCA TAT ATG TCC AAG GCA CAT GGT GTC GAC CCT 

Thr Leo Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro 

225 230 235 

AAC ATC AGG ACT GGG GTA AGG ACC ATC ACT ACG GGC GCC CCC ATT ACA 

Asn lie Arg Thr Gly Val Arg Thr He Thr Thr Gly Ala Pro He Thr 

240 245 250 255 

TAC TCC ACC TAT^GGC AAG TTT CTT GCC GAC GGT GGT TGC TCC GGG GGC 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly 

2(0 2(5 270 

GCC TAT GAC ATC ATA ATA TGT GAT GAC TGC CAC TCA ACT GAC TCG ACT 

Ala Tyr Asp lie He lie Cys Asp Glu Cys His Ser Thr Asp Ser Thr 

275 280 285 
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TCC ATT TTG GGC ATT GGC ACG GTC CTG GAC CAA GCG GAG ACG GCT GGA 
Ser lie leu Gly lie Gly Th r Val Leu Asp Gin Ala Glu Tbr Ah Gly 
290 29S 300 

GCG CGG CTC GTC GTG CTC GCC ACC GCT ACG CCT CCA GGA TCG GTC ACT 
Ala Ar* Leo Yal Tal Leu Ala Tbr Ala Thr fro Pro Gly Ser Val Tbr 
305 310 315 

GTG CCT CAT CCC AAC ATC GAG GAG GTG GCC TTG TCC AGC ACT GGA GAG 
Val Pro His Pro Aso He Glo Glo Val Ala Leo Ser Ser Tbr Gly Glu 
320 325 330 335 

ATT CCC TTC TAT GGC AAA GCC ATC CCC ATT GAG ACC ATC AAG GGG GGA 
He Pro Phe Tyr Gly Lys Ala lie Pro He Glu Tbr He Lys Gly Gly 
340 345 350 

AGG CAT CTC ATT TTC TGC CAC 
Ar g His Leu He Pbe Cys His 
355 

Sequence Listing 



SEQ ID NO: 11 

SEQUENCE LENGTH: 284 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

GTC GAC CCC AAT ATT AGA ACT GGG GTA AGG ACC ATC ACC ACG GGC GCT 
Val Asp Pro Aso He Arg Tbr Gly Val Arf Tbr lie Tbr Tbr Gly Ala 
15 10 15 

CCC ATT ACG TAT TCT ACC TAT GGC AAA TTC CTT GCC GAC GGT GGT TGC 
Pro lie Tbr Tyr Ser Tbr Tyr Gly Lys Pbe Leu Ala Asp Gly Gly Cys 
20 25 30 

TCT GGG GGC GCC TAT GAC ATC ATA ATC TGT CAT GAG TGC CAC TCA ACT 
Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Tbr 
35 40 45 

GAC TCG ACT TCC ATC TTG GGT ATC GGC ACA GCC CTG GAC CAA GCG GAG 
Asp Ser Tbr Ser He Leu Gly lie Gly Tbr Ala Leu Asp Gin Ala Glu 
50 55 60 
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ACG CCT GGA GCA CGG CTT GTC GTG CTC GCC ACC GCT ACG CCT CCA GGG 240 

Thr Ala Gly Ala Arg Leu Val Val Leu Ala Th r Ala Tkr Pro Pro G I y 

65 70 7$ SO 

TCG GTC ACC GTG CCG CAT CCC AAC ATC GAG GAG GTA GCC TTG CC 284 

Ser Val Tkr Val Pro His Pro Aso He Glo G 1 u Val Ala Leu 
85 90 



SEQ ID NO: 12 

SEQUENCE LENGTH: 641 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDHA to genomic RNA 

G GAC AAC TCA TCT CCC CCG GCG GTA CCG CAG ACA TTC CAG GTG GCC CAT 49 
Asp Aso Ser Ser Pro Pro Ala Val Pro Gin Tbr Phe Glo Val Ala His 
1 5 10 15 

CTA CAC GCT CCC ACT GGC AGC GGC AAG AGC ACT AAG GTG CCG GCT GCA ' 97 
Lea Bis Ala Pro Tbr Gly Ser G 1 y Lys Ser Tbr Ly s Val Pro Ala Ala 
20 25 30 

TAT GCA GCC CAA GGG TAC AAA GTA CTC GTC.CTG AAC CCG TCC GTT GCC 145 
Ty r Ala Ala Glo Gly Ty r Lys Val Leu Val Leu Aso Pro Ser Val Ala 
35 40 45 

GCC ACC TTA AGT TTC GGG GCG TAT ATG TCC AAG GCA CAT GGT GTT GAC 193 
Ala Thr Leo Ser Pbe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp 
50 55 60 

CCT AAT ATC AGA ACT GGG ACA AGG ACC ATC ACC ACG GGC GCT CCC ATC 241 
Pro Aso He Arg Tbr Gly Tkr Arg Tbr He Tbr Tbr Gly Ala Pro lie 
65 70 75 80 

ACG TAC TCC ACC TATCGC AAG TTC CTT GCA GAC GGT GGT TGC TCC GGA 289 
Tkr Tyr Ser Tkr Tyr Gly Lys Pbe Leu Ala Asp Gly Gly Cys Ser Gly 
85 90 95 

GGC GCC TAT GAC ATC ATA ATA TGC GAT GAG TGC CAC TCA ACA GAC TCG 337 
Gly Ala Tyr Asp He lie He Cys Asp Glo Cys His Ser Thr Asp Ser 
100 105 110 
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ACT TCC ATC TTA GGC ATT GGT ACG GTC CTG GAC CAA GCG GAG ACG GCT 385 
Thr Ser He Leo G I y lie Glj Thr Val Leu Asp Gin Ah Gin Thr Ala 
1 1 5 120 125 

GGA GCG CGA CTC GTC GTG CTC GCC ACC GCT ACG CCT CCA GGA TCG GTC 433 
G I y Ala Arg Leu Val Val Leo Ala Thr Ala Thr Pro Pro Glj Ser Val 
130 135 HO 



ACT GTG CCA CAT CCC AAC ATC GAG GAG GTG GCC CTG TCC AAC ACT GGA 481 
Thr Val Pro His Pro As n lie Gin G 1 u Val Ala Leu Ser Asd Thr Gly 
is 145 150 1 55 1 60 

GAG ATT CCC TTC TAT GGC AAA GCC ATC CCC ATT GAG GCC ATC AAG GGG 529 
Gh lie Pro Phe Tyr Gly Lys Ala He Pro He Glu Ala He Lys Gly 
20 1 65 170 175 

GGG AGG CAT CTC ATT TTC TGC CAT TCT AAG AAG AAG TGT GAT GAG CTC 577 
Gly Arg Bis Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu 
» 180 185 190 

GCC ACG AAG CTG TCG GCC CTC GGA CTC AAT GCT GTA GCG TAC TAC CGG 625 
Ala Thr Lys Leu Ser Ala Lea Gly Leu Asn Ala Val Ala Tyr Tyr Arg 
30 195 200 205 

GGT CTT GAT GTG TCC G 641 
Gly Leu Asp Val Ser 
210 



SEQ ID NO: 13 

SEQUENCE LENGTH: 4 32 base pairs 
SEQUENCE TYPE: dq c I e i c acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to geoonic RNA 

CA GGC GAG AGG CCG^CA GGG ATG TTT GAC AGC GTA GTG CTC TGT GAG 47. 
Gly Gin Arg Pro Thr Gly Met Phe Asp Ser Val Val Leu Cys Glu 
1 5 10 15 

TGC TAT GAT GCC GGG GCC GCC TGG TAC GAG CTT ACG CCT GCT GAG ACT 95 
Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr 
20 2S 30 
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ACG CTG AGA CTC CGG GCT TAT TTC AAC ACG CCC GGT TTG CCT GTA TGT 143 

5 Thr Val Arg Leu Arg Ala Tjr Phe Asa Thr Pro Gly Leu Pro Va I Cjs 

35 40 45 

CAA GAC CAC CTA GAG TTC TGG GAA GCG GTC TTC ACA GGT CTC ACA CAC 191 

io Gin Asp His Leo Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr Bis 

50 55 60 

ATT GAT GCC CAC TTC CTC TCC CAG ACG AAG CAA GGA GGA GAC AAC TTT 239 

is He Asp Ala His Phe Leo Ser Gin Thr Lys Gin Gly Gly Asp Aso Phe 

65 70 75 

GCG TAT CTA ACG GCC TAC CAG GCC ACA GTA TGC GCC AGG GCA AAG GCC 287 

20 Ala Tyr Leu Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala 

80 85 90 95 

CCC CCT CCT TCG TGG GAC GTG ATG TGG AAG TGT CTA ATC AGG CTC AAA 335 

25 Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu He Arg Leu Lys 

100 105 110 

CCT ACA TTG ACT GGT CCT ACC CCC CTC CTG TAC CGC TTG GGT GCC GTG 383 

30 Pro Thr Leu Thr Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val 

115 120 125 

ACT AAC GAG GTT ACC CTG ACG CAC CCC GTG. ACG AAA TAT ATC GCC ACG T 432 

35 Thr Aso Glu Val Thr Leu Thr His Pro Val. Thr Lys Tyr He Ala Thr 

130 135 140 
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SEQ ID NO: 14 

SEQUENCE LENGTH: 369 base pairs 
SEQUENCE TYPE: doc le ic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

ATG GGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACT AAC . 48 
Met Gly Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5-10 15 

CGT CGC CCA CAA GAC GTT AAG TTT CCG GGC GGC GGC CAG ATC GTT GGC 96 
Arg Arg Pro Gin Asp Val Lys Pbe Pro Gly Gly Gly GId lie Val Gly 
20 25 30 
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GGA GTA TAC TTG TTG CCG CGC AGG GGC CCC AGA TTG CGT GTG CGC GCG 
Gly Yal Tyr Leo leu Pro Arg Arg 6 1 y Pro Arg Leo C I y Val Arg Ah 
35 40 45 



144 



10 



15 



ACA AGG AAG ACT TCG AAG CGG TCC CAG CCA CGT GGG GGG CGC CGG CCC 192 

Thr Arg Lys Th r Ser Lys Arg Ser Gin Pro Arg Gly Gly Arg Arg Pro 

50 55 60 

ATC CCT AAA GAT CGG CGC TCC ACT GGC AAG TCC TGG GGG AAA CCA GGA 240 

He Pro Lis Asp Arg Arg Ser Tbr Gly Lys Ser Trp Gly lys Pro Gly 

65 70 75 80 



20 



TAC CCC TGG CCC CTA TAT GGG AAT GAG GGA CTC GGC TGG GCA GGG TGG 
Tyr Pro Trp Pro Leo Tyr Gly Asd Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 



288 



25 



CTT CTG TCC CCC CGA GGT TCC CGT CCC TCT TGG GGC CCC ACT GAC CCC 
Leo Leo Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Tor Asp Pro 
100 I05 HO 



336 



30 



CGG CAT AGG TCG CGC AAT GTG GGT AAG GTC ATC 
Arg His Arg Ser Arg As n Val Gly Lys Val He 
1 1 5 120 



369 



55 
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SEQ ID NO! 15 

SEQUENCE LENGTH: 932 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

CG CGC AAC TTG GGT AAG GTC ATC GAT ACC CTC ACA TGC GGC TTC GCC 
Arg As d Leu Gly Lys Val He Asp Thr Leo Thr Cys Gly Pfae Ala 
15 10 15 

GAC CTC ATG GGG TAC ATT CCG CTT GTC GGC GCC CCC CTA GGG GGT GCT 
Asp Leo Met Gly Tyr He Pro Leo Val Gly Ala Pro Leo Gly Gly Ala 
20 25 30 

GCC AGG GCC CTG GCA CAT GGT GTC CGG GTT CTG GAG GAC GGC GTG AAC 
Ala Arg Ala Leo Ala His Gly Val Arg Val Leo Glu Asp Gly Val As n 
35 40 45 

TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG 
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Pbe Ser lie Phe Leo Leo 
50 55 60 

GCT TTG CTG TCC TGT TTG ACC ATC CCA GCT. TCC GCT TAT GAG GTG CGC 
Ala Leo Leu Ser Cys Leu Thr lie Pro Ala. Ser Ala Tyr Glu Val Arg 
65 70 75 

AAC GTA TCC GGG ATA TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA ACT 
Aso Val Ser Gly He Tyr Bis Val Thr Asa Asp Cys Ser Asa Ser Ser 
80 85 90 95 

ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAT ACC CCC GGG TGC GTG 
lie Val Tyr Glo Ala Ala Asp Met He Met Bis Thr Pro Gly Cys Val 
100 105 110 

CCC TGC GTT CGG GAG AAC AAC TCC TCC CGT TGC TGG GCA GCG CTC ACT 
Pro Cys Val Arg GtarAsn Asi Ser Ser Arg Cys Trp Ala Ala Leu Thr 
115 120 125 

CCC ACG TTA GCG GCC AGG AAC ACC AGC GTC CCC ACT ACG ACA ATA CGA 
Pro Thr Leu Ala Ala Arg Asn Thr Ser Val Pro Thr Thr Thr He Arg 
130 135 140 
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CGG CAT GTC GAT TTG CTC GTT GGG GCG GCT CCT TTC TGC TCC GCT ATG 
A r g His Vai Asp Lea Leu Vil C 1 y Ala Ala Ala Pb e Cys Ser Ala Met 
145 ISO 155 

TAC GTG GGG GAT CTC TGT GGA TCT GTC TTC CTC GTT TCC CAG CTG TTC 
Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Lea Va I Ser Gin Leu Ph e 
160 165 170 175 

ACT TTC TCA CCT CGT CGG CAT GAG ACA GTA CAG GAC TGC AAC TGC TCA 
Th r Phe Ser Pro Ar g A r g Bis Glo Tor Ta I Gin Asp Cys Asa Cys Ser 
180 185 190 

ATC TAT CCC GGC CAC TTG ACA GGT CAT CGC ATG GCT TGG GAT ATG ATG 
He Tyr Pro Gly His Leo Thr Gly His Arg Met Ala Trp Asp Met Met 
195 200 205 

ATG AAC TGG TCA CCT ACA ACA GCC CTA GTG GTG TCG CAT CTA CTC CGG 
Met Asa Trp Ser Pro Th r Tor Ala Leu Val Val Ser Bis Leu Leu Arg 
210 215 220 

ATC CCA CAA GCT GTC ATG GAC ATG GTG GCG GGG GCT CAC TGG GGA GTC 
He Pro Glo Ala Val Met Asp Met Val Ala Gly Ala Bis Trp Gly Val 
225 230 235 

CTA GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT 
Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
240 245 250 255 

TTG ATT GTG ATG CTA CTC TTC GCC GGC GTT GAC GGG ACC ACC TAT GTG 
Leu He Val Met Leo Leo Phe Ala Gly Val Asp Gly Th r Tkr Tyr Val 
260 265 270 

ACA GGG GGG ACG ACA GGC CGC ACC ACC AGC TCG TTC GCA TCC CTC TTT 
Thr Gly Gly Tlr Tkr Gly Arg Thr Tkr Ser Ser Phe Ala Ser Leo Phe 
275 280 285 

ACA CTT GGG TCG -CAT CAG AAG GTC CAG CTT ATA AAT ACC AAT GGC AGC 
Thr Leo Gly Ser His Glo Lys Val Glo Leo He Aso Tkr Aso Gly Ser 
290 295 300 

TGG CAC ATC AAC AGG ACC GCC 
Trp Bis lie Aso Arg Thr Ala 
305 310 
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SEQ ID NO! 16 

SEQUENCE LENGTH: 559 base pairs 
SEQUENCE TYPE: nuc 1 e i c acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to geoomic RNA 

CGC CGG TAT GAG ACG GCG CAA GAC TGC AAT TGC TCA CTC TAT CCC GGT 48 
Arg Arg Tyr Glo Tbr Ala Glo Asp Cys Asn Cys Ser Leu Ty r Fro G I y 
1 5 10 15 

CAC GTA TCT GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCA 96 
His Yal Ser Gly His Arg Met Ah Trp Asp Met Met Met Asa Trp Ser 
20 25 30 

CCT ACA ACG CCC CTA GTG GTA TCG CAG CTA CTC CGG ATC CCA CAA CCC 144 
Pro Thr Tbr Ala Leo Val Yal Ser Gin Leo Leu Arg lie Pro Gin Ala 
35 40 45 

GTC GTG GAC ATG GTG GCG GGG GCC CAC TGG GGA GTC CTA GCG CGC CTT 192 
Yal Val Asp Met Yal Ala Gly Ala His Trp Gly Val Leo Ala Gly Leo 
50 55 60 

GCC TAC TAT TCC ATG GTG GCG AAC TGG GCT AAG GTC TTG GTT GTG ATG 240 
Ala Tyr Tyr Ser Met Yal Ala Asn Trp Ala Lys Val Leo Yal Val Met 
65 70 75 80 

CTA CTC TTT GCC GGC GTT GAC GAC GGG AAG ACC ACC GTG ACG GGG GGG 288 
Leo Leo Pbe Ala Gly Val Asp Asp Gly Lys Tbr Tbr Val Thr Gly Gly 
85 90 95 

AGC GCA GCC TTC CAG TCC AGG AAG TTA GTG TCC TTC TTC TCA CCA GGG 336 
Ser Ala Ala Pbe Gin Ser Arg Lys Leo Val Ser Pbe Pbe Ser Pro Gly 
100 105 110 

CCG AAA CAA AAT ATC CAG CTT GAT AAC ACC AAC GGC AGC TGG CAC ATC 384 
Pro Lys Glo Asn 14* Glo Leo Asp Asn Thr Aso Gly Ser Trp His He 
115 120 125 

AAC AGG ACT GCC CTC AAT TGC AAT GAC TCC CTC CAA ACT GGG TTC ATC 432 
Aso Arg Tbr Ala Leu Asn Cys Aso Asp Ser Leo Glo Thr Gly Phe He 
1 30 1 35 HO 
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GCT GCG CTG TTC TAC GCG CAC AAG TTC AAT TCG TCC GGA TGC CTA GAG 

Ah Ala Leu Phe Tyr Ala His Lys Phe Aso Ser Ser G 1 y Cys Leu Glu 

145 150 155 160 

CGC ATG GCC AGC TGC CGC CCC ATT GAC AAG TTC GCG CAG GGG TGG GGT 

Arg Met Ala Ser Cys A r g Pro He Asp Lys Phe Ala Gin Gly Trp Gly 

165 170 175 

CCC ATC ACT CAC GAT ACG CCT AAG ATC CCG G 

Pro He Thr Bis Asp Thr Pro Lys lie Pro 

180 185 



SEQ ID NO: 17 

SEQUENCE LENGTH: 276 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 

GA CAC CGT ATG GCA TGG GAC ATG ATG ATG AAC TGG TCG CCC ACG GCT 
Bis Arg Met Ala Trp Asp Met Met Met Asa Trp Ser Pro Tb r Ala 
1 5 10 15 

ACC ATG ATT CTG GCG TAT GTG ATG CGC ATC. CCC GAG GTC GTC ATG GAC 
Thr Met He Leo Ah Tyr Val Met Arg He Pro Glu Val Va 1 Met Asp 
20 25 30 

ATC ATT GGC GGG GCT CAC TGG GGC GTC ATG TTC GGCTTG GGC TAT TTT 
lie He Gly Gly Ala Bis Trp Gly Val Met Phe Gly Leu Gly Tyr Phe 
35 40 45 

TCT ATG CAG GGG GCT TGG GCA AAA GTC GTT GTC ATC CTT CTG CTG GCC 
Ser Met Gin Gly Ah Trp Ah Lys Val Val Val He Leu Leu Leu Ah 
50 55 60 

GCT GGG GTG GAT GCG ACT ACC CTC AGC GTT GGG GGC TCT GCC GCG CAC 
Ala Gly Val Asp Ah Thr Thr Leu Ser Val Gly Gly Ser Ah Ah Bis 
65 70 75 

ACC ACC GGC GGC CTT GTC GGC TTG TTC AAG CCT GGC G 
Tbr Tbr Gly Gly Leu Val Gly Leu Pbe Lys Pro Gly 
80 85 90 
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SEQ ID NO: 18 

SEQUENCE LENGTH: 742 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 

MOLECULE TYPE: cDNA to genomic RNA 



CG CTT GTC GGC GCC CCC CTA GGG GGT GCT GCC AGG GCC CTG GCA CAT 

Leu Val G I y Ala Pro Lea G I y Gly Ala Ala Arg Ala Leu Ala His 

1 5 10 15 

GGT GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA ACA GGG AAT TTG 

Glj Yal Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Tbr Gly Asn Leu 

20 25 30 

CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG CTG TCC TGT TTG 

Pro Gly Cys Ser Ph e Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu 

35 40 45 

ACC ATC CCA GCT TCC GCT TAT GAG GTG CGC AAC GTA TCC GGG ATA TAC 

Thr .He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly He Tyr 

50 55 60 

CAT GTC ACG AAC GAC TGC TCC AAC TCA AGT ATT GTG TAT GAG GCA GCG 

His Val Thr Asn Asp Cys Ser Asn Ser Ser. He Val Tyr Glu Ala Ala 

65 70 75 

GAC ATG ATC ATG CAT ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG AAC 

Asp Met He Met His Thr Pro Gly Cys * Va 1 Pro Cys Val Arg Glo Asn 

80 85 90 95 

AAC TCC TCC CGT TGC TGG GCA GCG CTC ACT CCC ACG HA GCG GCC AGG 

Asn Ser Ser Arg Cys Trp Ala Ala Leu Thr Pro Thr Leu Ala Ala Arg 

100 105 110 

AAC ACC AGC GTC CCC ACT ACG ACA ATA CGA CGG CAT GTC GAT TTG CTC 

Asn Thr Ser Val Pro^Thr Thr Thr He Arg Arg His Val Asp Leu Leu 

115 120 125 



GTT GGG GCG GCT GCT TTC TGC TCC GCT ATG TAC GTG GGG GAT CTC TGT 
Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu Cys 
130 135 140 
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GGA TCT GTC TTC CTC GTT TCC CAG CTG TTC ACT TTC TCA CCT CGT CGG 

Gly Ser Val Phe Leu Val Ser Glo Lea Phe Thr Phe Ser Pro Arg Arg 

145 150 155 

CAT GAG ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAC TTG 

His Gin Thr Val Gin Asp Cys A so Cys Ser lie Tyr Pro Gly Bis Leu 

160 165 170 175 

ACA GGT CAT CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCA CCT ACA 

Thr Gly His Arg Met Ah Trp Asp Met Met Met As d Trp Ser Pro Thr 

180 185 190 

ACA GCC CTA GTG GTG TCG CAT CTA CTC CGG ATC CCA CAA GCT GTC ATG 

Thr Ala Leo Val Val Ser His Leu Leo Arg He Pro Gin Ala Val Met 

195 200 205 

GAC ATG GTG GCG GGG GCC CAC TGG GGA GTC CTA GCG GGC CTT GCC TAC 

Asp Met Val Ala Gly Ala His Trp Gly Val Leo Ala Gly Leo Ala Tyr 

210 215 220 

TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA CTC 

Tyr Ser Met Val Gly A so Trp Ala Lys Val Leo He Val Met Leu Leu 

225 230 235 

TTC GCC GGC GTT GAC GGG ACC AC 

Phe Ala Gly Val Asp Gly Thr 
240 245 



SEQ ID NO: 19 

SEQUENCE LENGTH : 20 base pairs 
SEQUENCE TYPE: noc le ic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

GGATACACCG GTGACTTTGA 



SEQ ID NO: 20 

SEQUENCE LENGTH: 20 base pairs 
SEQUENCE TYPE: node i c acid 
STRANDEDNESS:s ingle 
TOPOLOGY: linear 
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TGCATGCACG TGGCGATGTA 20 
Sequence Listing 



SEQ ID NO: 2 1 

SEQUENCE LENGTH: 20 base pairs 
SEQUENCE TYPElnucIeic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

15 GATGCCCACT TCCTCTCCCA 20 



10 



SEQ ID NO: 22 
20 SEQUENCE LENGTH: 20 base pairs 
SEQUENCE TYPE:nucIeic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

25 

GTCAGGGTAA CCTCGTTGGT 20 



30 

Claims 



(1) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

35 whole or a part of the amino acid sequence represented by the SEQ ID NO. 1 . 

(2) A DNA fragment comprising a base sequence which encodes a non^A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 2. 

(3) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
40 polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

whole or a part of the amino acid sequence represented by the SEQ ID NO. 3. 

(4) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 4. 

45 (5) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 5. 

(6) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

so whole or a part of the amino acid sequence represented by the SEQ ID NO. 6. 

(7) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polype petide from a non-structural protein of thea non-A non-B hepatitis virus, said polypeptide consisting of 
the whole or a part of the amino acid sequence represented by the SEQ ID NO. 7. 

(8) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
55 polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

whole or a part of the amino acid sequence represented by the SEQ ID NO. 8 

(9) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
•polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
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whole or a part of the amino acid sequence represented by the SEQ ID NO. 9. 

(10) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 10. 
5 (1 1 ) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 

polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 11. 

(12) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

w whole or a part of the amino acid sequence represented by the SEQ ID NO. 12. 

(13) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 13. 

(14) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
15 polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 

or a part of the amino acid sequence represented by the SEQ ID NO. 14. 

(1 5) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 
or a part of the amino acid sequence represented by the SEQ ID NO. 15. 

20 (1 6) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 
or a part of the amino acid sequence represented by the SEQ ID NO. 16. 

(17) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 

25 or a part of the amino acid sequence represented by the SEQ ID NO. 17. 

(18) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 
or a part of the amino acid sequence represented by the SEQ ID NO. 18. 

(19) An expression vector containing the DNA fragment according to any one of claims 1 to 18 in a cloning 
30 site downstream of a promoter gene in the vector. 

(20) An expression vector according to claim 19, wherein the vector is a plasmid. 

(21) An expression vector according to claim 20, wherein the vector is plasmid TrpTrpE C11-C21. 

(22) An expression vector according to claim 20, wherein the vector is plasmid Trp*TrpE*C11-7. 

(23) A transformant comprising a host cell transformed with the expression vector according to any one of 
35 claims 19 to 22 

(24) A transformant according to claim 23, wherein the host cell is Escherichia coll 

(25) A process for producing a recombinant non-A non-B hepatitis-specific antigen polypeptide obtained 
by expression of the DNA fragment according to any one of claims 1 to 18, which comprises the following steps 
of: 

40 constructing a replicable expression vector which is capable of expressing said DNA fragment in an 

appropriate host ceil; 

obtaining a transformant by introducing said expression vector into the host cell; 
producing said recombinant polypeptide by culturing said transformant under such conditions that said 
DNA fragment is expressed; and 
4$ recovering the recombinant polypeptide. 

(26) A recombinant non-A non-B hepatitis-specific antigen polypeptide obtained by expressing the DNA 
fragment according to any one of claims 1 to 18. 

(27) A method for amplifying a non-A non-B hepatitis virus gene using sense and/or antisense sequence 
synthesized on the basis of a partial base sequence of the DNA fragment according to any one of daims 1 to 

60 18. 

(28) A single strand DNA sequence for PCR primer represented by SEQ ID NO. 19. 

(29) A single strand DNA sequence for PCR primer represented by SEQ ID NO. 20. 

(30) A single strand DNA sequence for PCR primer represented by SEQ ID NO. 21. 

(31) A single strand DNA sequence for PCR primer represented by SEQ ID NO. 22. 

55 (32) A method for detecting a non-A non-B hepatitis virus gene in a fluid sample to be examined, which 
comprises the following steps of: 

isolating RNA from a fluid sample to be examined; 

synthesizing cDNA by treating the RNA with a reverse transcriptase; 
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subjecting the cDNA to polymerase chain reaction using at least one single strand DNA sequence 
according to any one of claims 28 to 31; and 

detecting an amplified non-A non-B hepatitis virus gene. 

(33) An immunological method for detecting an antibody directed against a non-A non-B hepatitis vims anti- 
5 gen, which comprises the following steps of: 

incubating a fluid sample which may contain an anti-non-A non-B hepatitis virus antibody, in the presence 
of at least one recombinant non-A non-B hepatitis-specific-antigen polypeptide according to claim 26 under 
such conditions that said antibody and said polypeptide are capable of undergoing antigen-antibody reaction; 
and 

10 detecting an antigen-antibody complex. 

(34) Use of the recombinant non-A non-B hepatitis-specific antigen polypeptide according to claim 26 in 
the detection of the non-A non-B hepatitis virus. 

(35) Use of the single strand DNA sequence for PCR primer according to any one of claims 28 to 31 in the 
detection of the non-A non-B hepatitis virus. 
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Fig. 2 



10 20 30 40 50 60 

CACGCCCGGTTTGCCCGTGTGTCAAGACCACCTGGAGTTCTGGGAAGCGGTCTTCACAGG 
ThrProGlyLeuProValCysGlnAspHi sLeuGluPheTrpG luAl aVa IPheTh rG 1 y 



70 80 90 100 110 120 

TCTCACGCACATTGATGCCCACTTCCTCTCCCAGACAAAGCAAGGAGGAGACAACTTCGC 
LeuThrHisIleAspAlaHi sPheLeuSe rG 1 nThrLy sG InGlyGlyAspAsnPheAla 



130 140 150 ISO 170 180 

GTATCTAACGGCCTACCAGGCCACAGTGTGCGCTAGGGGAAAGGCCCCTCCTCCCTCGTG 
Ty rLeuThrAlaTyrGlnAlaThrValCysAlaArgAlaLysAlaProProProSerTrp 



190 200 210 220 230 240 

GGATGTGATGTGGAAATGTCTAGCTAGGCTGAAGCCTACACTAATTGGTCCTACCCCCCT 
AspVa lMe tTrpLy sCy sLeuA 1 aA rgLeuLy sProTh rLeu 1 1 eG 1 yProTh rProLeu 



250 260 270 280 290 300 

CCTGTACCGCTTGGGTGCCGTGACCAACGAGGTTACCCTGACGCACCCCGTGACGAAATA 
LeuTyrArgLeuGlyAlaValThrAsnGluValThrLeuThrHi sProVa IThrLy sTy r 



310 320 330 340 350 360 

CATCGCCACGTGCATGCAAGCTGACCTCGAGATCATGACGAGCACATGGGTCCTAGCAGG 
I leAlaThrCysMe tGlnAlaAspLeuGluI leMe tThrSe rTh rTrpVa lLeuAl aG 1 y 



370 380 390 400 410 420 

GGGGGTGCTAGCCGCCGTGGCAGCTTACTGCCTGGCAACCGGCTGTGTTTCCATCATCGG 
Gly Va lLeuAl aAl aVa 1 A 1 aAl aTy rCy sLeuAl aThrGl yCy sVa lSe rllelleGly 



430 440 450 460 470 480 

CCGCCTACACCTGAATGATCAAGTGGTTGTGACTCCTGACAAAGAAATCTTATATGAGGC 
ArgLeuHisLeuAsnAspGlnValValValThrProAspLysGluIleLeuTy rGiuAla 



490 500 510 520 530 540 

CTTTGATGAGATGGAAGAATGCGCCTCCAAAGCCGCCCTCATTGAGGAAGGGCAGCGGAT 
PheAspGluMetGlufeluCysAlaSerLysAlaAlaLeuI leGluGluGlyGlnArgMe t 



550 560 570 580 590 600 

GGCGGAGATGCTCAAGTCTAAGATACAAGGCCTCCTACAACAGGCCACAAGACAGGCCCA 
A laG 1 uMe tLeuLy sSe r Ly s I 1 eG 1 nG 1 yLeuLeuG J nG 1 nA I aTh rArgGlnAlaGln 



610 

AGACATACAGCCAGC 
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Fig. 4 



10 20 30 40 50 60 

CTGGTATGAACTTACGCCTQCTGAGACTACGGTGAGACTCCGGGCCTATTTCAACACGCC 
TrpTy rG luLeuTh rProA 1 aGluThrTh rVa 1 Ar gLeuArgA 1 aTy rPheAsnTh rPro 

70 80 90 100 110 120 

CGGCCTGCCTGTGTGTCAAGACCACCTGGAATTCTGGGAGCCGGTCTTCACAGGTCTCAC 
GlyLeuProValCysGlnAspHi sLeuGluPheTrpGluAl aVa lPheTh rG 1 yLe uTh r 

130 140 150 160 170 180 

ACACATCGATGCCCACTTCCTCTCCCAGACGAAGCAAGGAGGAGATAACTTTGCATATTT 
Hi si leAspAlaHisPheLeuSerGlnThfLysGlnGlyGlyAspAsnPheAlaTy rLeu 

190 200 210 220 . 230 240 

AACAGCCTACCAGGCCACAGTCTGCGCTAGGGCAAAGGCTCCCCCTCCTTCGTGGGACGT 
ThrAlaTyrGlnAlaThrValCysAlaArgAlaLysAlaProProProSerTrpAspVal 

250 260 270 280 290 300 

GATGTGGAAGTGTTTGATTAGGCTCAAACCTACACTGACTGGTCCTACCCCCCTCCTGTA 
MetTrpLysCysLeuIleArgLeuLysProThrLeuThrGlyProThrProLeuLeuTyr 

310 320 330 340 350 360 

CCGCTTGGGTGCCGTGACCAACGAGGTTACCCTGACTCACCCCATGACGAAATATATCGC 
ArgLeuGlyAlaValThrAsnGluValThrLeuThrHisProMetThrLysTyrlleAla 

370 380 390 400 410 420 

CACTTGTATGCAAGCTGATCTTGAGATCATGACAAGCACATGGGTCTTGGCGGGGGGGGT 
ThrCysMetGlnAlaAspLeuGluIleMe tThrSerThrTrpValLeuAlaGlyGlyVal 

430 440 450 460 470 480 

GCTAGCCGCTGTGGCAGCTTACTGCCTAGCGACCGGCTGCATTTCCATCATTGGCCGCCT 
LeuAlaAlaValAlaAUTyrCysLeuAlaThrGlyCysIleSer IlelleGlyArgLeu 

490 500 510 520 530 540 

TCACCTGAATGATCGGGTGGTCGTGACCCCTGATAAGGAAATTTTATATGAGGCCTTTGA 
HlsLeuAsnAspArgValValValThrProAspLysGluIleLeuTyrGluAlaPheAsp 

550 560 570 580 590 * GOO 

TGAGATGGAAGAGTGCGCCTCCAAAGCCGCCCTCATTGAGGAAGGGCAGCGGATGGCGGA 
G luMe tGluGluCysAlaSe rLy sAl aAl aLeu I I eG luGloG 1 yG 1 nAr gMe tAlaGlu 



610 620 630 

GATGCTGAAGTCTAAAATACAAGGCCTCTT 
Me tLeuLysSe rLy s 1 1 eG 1 nGl yLeu 
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Fig. 5c 



790 800 810 820 830 840 

GCTGAGACCACAGTGAGACTCCGGGCTTATTTCAACACGCCCGGTTTGCCCGTGTGTCAA 
AiaGluThrThrValArgLeuArgAlaTyrPheAsnThrProGlyLeuProValCysGIn 



850 860 870 880 890 900 

GACCACCTGGAGTTCTGGGAAGCGGTCTTCACAGGTCTCACGCACATTGATGCCCACTTC 
AspHi sLeuGluPheTrpGluAlaVa IPheThrGlyLeuThrHi s 1 1 eAspA laHi sPhe 



910 920 930 940 950 960 

CTCTCCCAGACAAAGCAAGGAGGAGACAACTTCGCGTATCTAACGGCCTACCAGGCCACA 
LeuSe rG InTh rLy sGl nG 1 yG 1 yAspAsnPheAl aTy rLeuTh r A 1 aTy rG InA laThr 



970 980 990 1000 1010 1020 

GTGTGCGCTAGGGCAAAGGCCCCTCCTCCCTCGTGGGATGTGATGTGGAAATGTCTAGCT 
ValCysAlaArgAlaLysAlaProProProSerTrpAspValMetTrpLysCysLeuAla 



1030 1040 1050 1060 1070 1080 

AGGCTGAAGCCTAGACTAATTGGTCCTACCCCCCTCCTGTACCGCTTGGGTGCCGTGACC 
ArgLeuLysProThrLeuI 1 eG 1 yPr oTh rProLe uLeuTy rArgLe uGl yA 1 aVa 1 Th r 



1090 1100 1110 1120 1130 1140 

AACGAGGTTACCCTGACGCACCCCGTGACGAAATACATCGCCACGTGCATGCAAGTGAAC 
AsnGluValThrLeuThrHisProValThrLysTy rl 1 e AlaTh rCy sMe tGlnVa lAsn 



1150 1160 1170 1180 1190 1200 

CTCGAGATCATGACGAGCACATGGGTCCTAGCAGGGGGGGTGCTAGCCGCCGTGGCAGCT 
LeuGluI leMe tThrSe rThrTrpVa lLeuAl aG lyGlyValLeuA 1 aAl aVa 1AUA1 a 



1210 1220 1230 1240 1250 1260 

TACTGCCTGGCAACCGGCTGTGTTTCCATCATCGGCCGCCTACACCTGAATGATCAAGTG 
Ty rCy sLeuAl aTh rGl yCy sVa lSe r 1 1 e I leGlyArgLeuHi sLeuAsnAspG 1 nVa 1 



1270 1280 1290 1300 1310 1320 

GTTGTGACTCCTGACAAAGAAATCTTATATGAGGCCTTTGATGAGATGGAAGAATGCGCC 
ValValThrProAs'pl.ysGluIlcLeuTyrGluAlaPhcAspGluMe tGluG luCysAl a 



1330 1340 1350 1360 J 370 1380 

TCCAAACCCGCCCTCATTGAGGAAGGGCAGCGGATGGCGGAGATGCTCAAGTCTAAGATA 
Se rLy sA 1 aAlaLeu II eG 1 uG 1 uG 1 yG 1 nA r gMe t A 1 aG luMe tLeuLy sSe rLys I le 



1390 1400 1410 1420 

CAAGCCCTCCTACAACAGGCCACAAGACAGGCCCAAGACATACACC 
G InC I yLeuLeuG I nC \ nAlaThrArgG ) nAlaG I nAsp I leCln 
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Fig. 6b 

370 380 390 400 410 420 

GAGCGCGGCTCGTCGTGCTCGCTACCGCTACGCCTCCGGGATCGGTCACCGTGCCACATC 
AlaArgLeuValValLeuAlaThrAlaThrProProGlySerValThrValProHi sPro 

430 440 450 460 470 480 

CCAATATCGAGGAGGTGGCCCTGTCCACCACTGGAGAGATTCCCTTCTACGGCAAAGCTA 
Asnl leGluGluValAlaLeuSe rTh rTh rG 1 yG lu I leProPheTy rG lyLy sAl a lie 

490 500 510 520 530 540 

TCCCCATCGAGACAATCAAGGGGGGGAGGCATCTCATCTTCTGCCGTTCCAAGAAGAAGT 
Pro I leGluThrl leLysGlyGlyArgHi sLeuI lePheCy sArgSe rLysLy sLy sCy s 

550 560 570 580 590 600 

GTCACGAGCTCGCTGGAAAGCTGTCAGCCCTCGGAATCAACGCTGTAGCGTACTACCGGG 
AspGluLeuAl aGlyLysLeuSe rAl aLeuGl y I IcAsnAlaVa 1 A 1 aTy rTy rArgG ly 

610 620 630 640 650 660 

GTCTTGATGTATCCGTCATACCGACCAGCGGAGACGTCGTTGTCGTGGCAACAGACGCTC 
LeuAspValSe rVal I leProThrSe rG I yAspValVa lValVa 1 A 1 aThr AspAl a Leu 

670 680 690 700 710 720 

TAATGACGGGCTACACCGGTGACTTTGATTCAGTGATCGACTGCAATACATGTGTCACCC 
Me tTh rG lyTy rThrGl yAspPheAspSe rVa 1 1 leAspCysAsnThrCysVa IThrGIn 

730 740 750 760 770 780 

AGACAGTCGACTTCAGCTTGGACCCTACCTTCACCATTGAGACGACGACCGTGCCTCAAG 
ThrValAspPheSerLeuAspProThrPheThrl leGluThrThrThrValProGlnAsp 

790 800 810 820 830 840 

ACGCGGTGTCACGCTCGCAGCGGCGAGGCAGAACTGGTAGGGGTAGAGGGGGCATATACA 
AlaValSerArgSe^lnArgArgGlyArgThrGlyArgGlyArgGlyGlyl leTyrArg 

850 

GGTTTGTGACTCCAG 
PheValThrPro 
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Fig. 7 



10 20 30 40 50 60 

GACGAGCTCGCCGCAAAGCTGTCAGGCCTCGGAGTCAATGCTGTGGCATACTACCGGGGT 
AspGluLeuAlaAlaLysLeuSerGlyLeuGlyValAsnAlaValAUTyrTyrArgGly 

70 80 90 100 110 120 

CTCGATGTGTCTGTCATACCGACGAGCGGGGACGTCGTTGTTGTGGCAACAGACGCTCTA 
LeuAspValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 

130 140 150 160 170 180 

ATGACGGGCTATACCGGCGACTTTGACTCGGTGATCGACTGCAATACATGTGTCACCCAA 
MetThrGlyTyrThrGlyAspPheAspSerVallleAspCysAsnThrCysValThrGln 

190 200 210 220 230 240 

ACAGTCGATTTCAGCTTGGACCCTACTTTCACCATTGAGACGACGACCGTGCCCCAAGAC 
ThrValAspPheSerLeuAspProThrPheThrlleGluThrThrThrValProGlnAsp 

250 260 270 280 290 300 

GCGGTGTCGCGCTCGCAGCGGCGAGGCAGGACTGGTAGGGGCAGGGTGGGCATATACAGG 

AlaValSerArgSerGlnArgArgGlyArgThrGlyArgGlyArgValGly IleTyrArg 



310 

TTTGTGACTCCCGAG 
PheValThrProGlu 
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Fig. 8a 



10 20 30 40 50 60 

GTGATGAGCTCGCCGCAAAGCTCTCAAGCCTCGGACTCAACGCTGTAGCATATTACCGGG 
AspGluLeuAJ aAl aLysLeuSe r Se r LeuG 1 yLeu AsnA laVa 1 A 1 aTy rTy r A r gG 1 y 

70 80 90 100 110 120 

GTCTTGATGTGTCCGTCATACCGACTAGTGGAGACGTCGTTGTCGTGGCAACAGACGCTC 
LeuAspValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 

130 140 150 160 170 180 

TAATGACGGGCTATACCGGCGACTTTGACTCAGTGATCGACTGTAACACATGTGTCACCC 
Me tThrGlyTyrThrGlyAspPheAspSerVallleAspCysAsnThrCysValThrGIn 

190 200 210 220 230 240 

AGACAGTTGATTTCAGCTTGGATCCAACCTTCACCATTGAGACGACGACCGTGCCTCAAG 
ThrValAspPheSerLeuAspProThrPhcThrlleGluThrThrThrValProGlnAsp 

250 260 270 280 290 300 

ACGCGGTGTCGCGCTCGCAGCGGCGAGGTAGGACTGGCAGGGGCAGGGGCGGCATCTATA 
AlaValSerArgSerGlnArgArgGlyArgThrGlyArgGlyArgGlyGlylleTyrArg 

310 320 330 340 350 360 

GGTTTGTGACTCCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTCCTGTGTGAGT 
PheValThrProGlyGluArgProSerGlyMetPheAspSerSerValLeuCysGluCys 

370 380 390 400 410 420 

GTTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAGACCACGGTTAGGTTGC 
TyrAspAlaGlyCysAlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArg 

430 440 450 460 470 480 

GGGCTTACCTAAATACAXCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGG 
AlaTyrLeuAsnThrProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGly 

490 500 510 520 530 540 

GCGTCTTCACAGGCCTCACCCACATAGATGCCCATTTCTTGTCTCAGACTAAGCAGGCAG 

ValPheThrGlyLeuThrHisI leAspAlaHi sPheLeuSerGlnThrLysGlnAlaGl y 



60 



EP 0 468 657 A2 



Fi g.db 



550 560 570 580 590 600 

GAGACAACTTTCCCTACCTGGTGGCATACCAAGCTACAGTGTGCCCCAGGCCTCAGGCTC 
AspAsnPheProTyrLeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaPro 

G10 620 630 640 650 660 

CACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTGAAACCTACGCTGCACG 
ProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGly 

G70 680 690 700 710 720 

GGCCAACACCCCTGCTGTATAGGCTAGGAGCCGTCCAAAATCAGGTCACCCTCACACACC 
ProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluValThrLeuThrHisPro 

730 740 750 760 77,0 780 

CCATAACCAAATTCATCATGGCATGCATGTCGGCTGATCTGGAGGTCGTCACCAGCACCT 
IleThrLysPbcIleMetAlaCysMetSerAlaAspLeuGluValValThrSerThrTrp 

790 800 810 820 830 840 

GGGTGCTGGTGGGCGGAGTCCTTGCAGCTCTGGCCGCATATCGCCTGACAACAGGCAGCG 

ValLeuValGlyGlyValLeuAlaAlaLeuAlaAlaTyrArgLeuThrThrGlySerVal 

850 860 870 880 890 900 

TGGTCATCGTGGGTAGGATCATCTTGTCTGGGAGGCCGGCTGTCATTCCCGACAGGGAAG 
VallleValGlyArgllelleLeuSerGlyArgProAlaVallleProAspArgGluVal 



910 
TCCTTTACCGG 
LeuTyrArg" 
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Fig. 9 



10 20 30 40 50 60 

CGACAACCGTGCCCCAAGACGCGGTGTCGCGCTCACAACGGCGGGGTAGGACAGGTAGGG 
ThrThrValProGlnAspAlaValSerArgSerGlnArgArgGlyArgThrGlyArgGly 

70 80 90 100 110 120 

GCAGGAGAGGCATCTACAGATTTGTGACTCCGGGAGAACGGCCCTCGGGCATGTTCGATT 
ArgArgGlylleTyrArgPheValThrProGlyGluArgProSerGlyMe tPheAspSer 

130 140 150 160 170 180 

CTTCGGTCCTGTGTGAGTGCTATGACGCGGGCTGCGCTTGGATCGAGCTCACGCCCGCCG 
SerValLeuCysGluCysTyrAspAlaGlyCysAUTrpIleGluLeuThrProAUGlu 

190 200 210 220 230 24-0 

AGACCTCAGTTAGGTTGCGGGCTTACCTAAATACACCAGGGTTGCCCGTCTGCCAGGACC 
ThrSerValArgLeuArgAlaTyrLeuAsnThrProGlyLeuProValCysGlnAspHis 

250 260 270 280 290 300 

ACCTGGAATTCTGGGAGAGCGTCTTCACAGGCCTCACCCATATAGATGCCCACTTCTTGT 
LeuGluPheTrpCluSerValPheThrClyLeuThrHislleAspAlaHisPheLeuSer 

310 320 330 340 350 360 

CCCAGACCAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAAGCTACAGTCT 
GlnThrLysGlnAlaGlyAspAsnPheProTyrLcuValAlaTyrGlnAUThrValCys 

370 380 390 400 4 10 420 

GCGCCAGGGCCCAGGCTCCACCACCATCGTGGGATCAAATGTGGAAGTGTCTCATACGGC 
AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeulleArgLeu 

430 440 450 460 470 480 

TGAAACCTACGCTACACGGGCCAACACCCCTGTTGTATAGGCTGGGAGGCGTCCAAAATG 
LysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGlu 



AGGTCACCC 
ValThr 
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Fig. 10 a 



n 10 20 30 40 50 ou 

GTGGTCTCCTGGGTGCCATCGTGGTCAGCCTAACGGGCCGCGACAAGAACCAGGTCGAGG 
GlyLeuLeuGlyAlal UValYalSerLeuThrGlyArgAspLysAsnGlnValGluG 

70 80 90 100 HO 

GGGAGGTTCAGGTGGTCTCCACCGCAACGCAATCTTTCCTGGCGACCTGCGTCAATGGCGT 

iyGluValGlnValValSerThrAlaThrGlnSerPheLeuAUThrCysValAsnGlyVa 

130 140 150 160 170 

GTGTTGGACCGTCTACCATGGCGCCGGCTCGAAAACCCTGGCCGGCCCGAAGGGTCCAGTC 

icysTrpThrValTyrHisGlyAlaGlySerLysThrLeuAlaGlyProLysGlyProVal 
ion 200 210 220 230 

acccaaItgtacactaatgtggaccaggacctcgtcggctggccggcgccctccgg^ 

ThrGlnMetTyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProSerGlyAlaA 

,c n ogn 270 280 290 

GGTCCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGCT 
rgSerLeuThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAs 

om 'ion 330 340 350 360 

tgtca?tccggtgcg2cggcggggcgatagcagggggagcctgctttccccc^ 

pVallleProValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArgProLeu 

tcctacttgaaggg?tcctcagg?Sgtccactgc?tt^ 

SerTyrLeuLysGlySerSerGlyGlyProLeuLeuCysProSerGlyHislievaiuiyi 
TCTTCCGGGCTGCCGTGTGCACCCGGGGGGTTGCGAAGGCG^ 

lePheArgAlaAlaValCysThrArKGlyYalAlaLysAlaValAspPheValProvaiu! 

S2tatggaaa2?actatgcgg?ctccggtct?cacggatm?^^ 

uSerMetGluThrThrMetArgSerProValPheThrAspAsnSerSerProProAlaYal 

5 c2gcagacatt2caagtggcc2atctgcatg2ccccactgg?agcggc^ 

ProGlnThrPheGlnValAlaHlsLeuHisAlaProThrGlySerGlyLysSerThrLysV 

?gccggctgcatac^agcc2agggataS1aggtactcgwtgaaccc^ 

alProAlaAlaTyrAlaAlaGlnGlyTyrLysValLeuValLeuAsnProSerValAlaAl 



610 



CACCTTAGGTTTTGGAGCATATATGTCCAAGGCACATGGTGTCGACCCTAACA^ 

740 750 760 770 780 

GGGGTAAGGACCATCACTACGGGCGCCCCCATTACATACTCCACCTATGGCA 

GlyValArgThrlleThrThrGlyAlaProIleThrTyrSerThrTyrGlyL 



740 I! 0 att Ar»Ax A^Tr.r.ACCTATGGCAAGTTTCTTG 

LysPheLeuA 
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Fig. 10b 



800 810 820 830 840 

CCGACGGTGGTTGCTCCGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAAC 
laAspGlyGlyCysSerGlyGlyAlaTyrAspI lei lei 1 eCysAspGluCysHi sSerTh 

860 870 880 890 900 

TGACTCGACTTCCATTTTGGGCATTGGCACGGTCCTGGACCAAGCGGAGACGGCTGGAGCG 

rAspSerThrSerlleLeuGlyl leGlyThrValLeuAspGl nAlaGluThrAlaGlyAl a 

920 930 940 950 960 970 

CGGCTCGTCGTGCTCGCCACCGCTACGCCTCCAGGATCGGTCACTGTGCCTCATCCCAACA 
ArgLeuValValLeuAlaThrAlaThrProProGlySerValThrYalProHisProAsnl 

980 990 1000 1010 1020 1030 

TCGAGGAGGTGGCCTTGTCCAGCACTGGAGAGATTCCCTTCTATGGCAAAGCCATCCCCAT 
leGluGluValAlaLeuSerSerThrGlyGluI leProPheTyrGlyLysAlal leProI 1 

1040 1050 1060 1070 

TGAGACCATCAAGGGGGGAAGGCATCTCATTTTCTGCCAC 
eG 1 uThr 1 1 eLy s G 1 yG 1 yArgH i s Leu 1 1 ePh eCys His 
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Fig. II 



1 10 20 30 40 50 60 

GTCGACCCCAATATTAGAACTGGGGTAAGGACCATCACCACGGGCGCTCCCATTACGTAT 
ValAspProAsnl UArgThrGlyValArgThrl UThrThrGlyAlaProI leThrTyr 

70 80 90 100 110 

TCTACCTATGGCAAATTCCTTGCCGACGGTGGTTGCTCTGGGGGCGCCTATGACATCATAA 
SerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspI lei lei 

130 140 150 160 170 

TCTGTGATGAGTGCCACTCAACTGACTCGACTTCCATCTTGGGTATCGGCACAGCCCTGGA 

1 eCysAspGl uCysHi sSerThr AspSerThrSerl 1 eLeuGlyl 1 eGlyThrAl aLeuAs 

190 200 210 220 230 

CCAAGCGGAGACGGCTGGAGCACGGCTTGTCGTGCTCGCCACCGCTACGCCTCCAGGGTCG 

pGlnAlaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThrProProGlySer 

250 260 270 280 

GTCACCGTGCCGCATCCCAACATCGAGGAGGTAGCCTTGCC 
ValThrValProHisProAsnl leGluGluValAlaLeu 
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Fig. 12 



10 20 30 40 50 60 

GGACAACTCATCTCCCCCGGCGGTACCGCAGACATTCCAGGTGGCCCATCTACACGCTCC 

AspAsnSerSerProProAlaYalProGlnThrPheGlnValAlaHisLeuHisAlaPr 

70 80 90 100 110 

CACTGGCAGCGGCAAGAGCACTAAGGTGCCGGCTGCATATGCAGCCCAAGGGTACAAAGTA 

oThrG 1 ySerG 1 yLysSerThrLys Val Pro A 1 aAl aTyr A 1 aA 1 aG 1 nG 1 yTyrLys Va 1 

130 140 150 160 170 

CTCGTCCTGAACCCGTCCGTTGCCGCCACCTTAAGTTTCGGGGCGTATATGTCCAAGGCAC 

LeuVal LeuAsnProSerVal Al aAl aThrLeuSerPheG 1 yAl aTyrMe t SerLys A 1 aH 

190 200 210 220 230 

ATGGTGTTGACCCTAATATCAGAACTGGGACAAGGACCATCACCACGGGCGCTCCCATCAC 

i sGl yVal AspProAsnl 1 eArgThrGlyThrArgThrl 1 eThrThrGl yAl aProI ] eTh 

250 260 270 280 290 

GTACTCCACCTATGGCAAGTTCCTTGCAGACGGTGGTTGCTCCGGAGGCGCCTATGACATC 

r TyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspI le 

310 320 330 340 350 360 

ATAATATGCGATGAGTGCCACTCAACAGACTCGACTTCCATCTTAGGCATTGGTACGGTCC 

1 1 e 1 1 eCysAspG 1 uCysHi sSerThr AspSerThrSer I 1 eLeuGly 1 1 eGl yThrVal L 

370 380 390 400 410 420 

TGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCAGG 

euAspGl nAl aGl uThr Al aGl yAl aArgLeuYal ValLeuAl aThrAl aThrProProGl 

430 • 440 450 460 470 480 

ATCGGTCACTGTGCCACATCCCAACATCGAGGAGGTGGCCCTGTCCAACACTGGAGAGATT 

yScrValThrValProHisProAsnlleGluGluValAlaLeuSerAsnThrGlyGluI le 

490 500 510 520 530 5.40 

CCCTTCTATGGCAAAGCCATCCCCATTGAGGCCATCAAGGGGGGGAGGCATCTCATTTTCT 

ProPheTyrGlyLysAl al 1 ePro 1 1 eGluAl al 1 eLysGl yGl yArgHi sLeu 1 1 ePheC 

550 560 570 580 590 600 

GCCATTCTAAGAAG^fAGTGTGATGAGCTCGCCACGAAGCTGTCGGCCCTCGGACTCAATGC 

ysH i sSerLysLysLysCys AspGl uLeuA 1 aThrLysLeuSer A 1 aLeuG 1 yLeuAs n A 1 

610 620 630 640 

TGTAGCGTACTACCGGGGTCTTGATGTGTCCG 
aval Al aTyrTyrArgGlyLeuAspValSer 
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Fig. 13 



1 10 20 30 40 50 SO 

CAGGCGAGAGGCCGACAGGGATGTTTGACAGCGTAGTGCTCTGTGAGTGCTATGATGCCG 
GlyGluArgProThrGlyMetPheAspSerYalValLeuCysGluCysTyrAspAlaG 

70 80 90 100 HO 

GGGCCGCCTGGTACGAGCTTACGCCTGCTGAGACTACGGTGAGACTCCGGGCTTATTTCAA 

lyAlaAlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAUTyrPheAs 

130 140 150 160 170 

CACGCCCGGTTTGCCTGTATGTCAAGACCACCTAGAGTTCTGGGAAGCGGTCTTCACAGGT 

nThrProGl yLeuProVal CysGl nAspHi sLeuG 1 uPheTrpG 1 uAl aVa 1 PheThrG 1 y 

190 200 210 220 230 

CTCACACACATTGATGCCCACTTCCTCTCCCAGACGAAGCAAGGAGGAGACAACTTTGCGT 

LeuThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnGlyGlyAspAsnPheAlaT. 

250 260 270 280 290 

ATCTAACGGCCTACCAGGCCACAGTATGCGCCAGGGCAAAGGCCCCCCCTCCTTCGTGGGA 

yrLeuThrAlaTyrGlnAlaThrValCysAlaArgAlaLysAlaProProProSerTrpAs 

310 320 330 340 350 360 

CGTGATGTGGAAGTGTCTAATCAGGCTCAAACCTACATTGACTGGTCCTACCCCCCTCCTG. 

pValMetTrpLysCysLeuIleArgLeuLysProThrLeuThrGlyProThrProLeuLeu 

370 380 390 400 410 420 

TACCGCTTGGGTGCCGTGACTAACGAGGTTACCCTGACGCACCCCGTGACGAAATATATCG 

TyrArgLenGlyAlaValThrAsnGluValThrLeuThrHisProValThrLysTyrlleA 

430 
CCACGT 
laThr 
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Fig. 14 



1 10 20 30 40 50 60 

ATGGGCACGAATCCTAAACCTCAAAGAAAAACCAAAAGAAACACTAACCGTCGCCCACAA 
MetGlyThrAsnProLysProGlnArgLysThrLysArgAsn.ThrAsnArgArgProGln 

70 80 90 100 110 

GACGTTAAGTTTCCGGGCGGCGGCCAGATCGTTGGCGGAGTATACTTGTTGCCGCGCAGGG 
AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArgG 

130 140 150 160 170 

GCCCCAGATTGGGTGTGCGCGCGACAAGGAAGACTTCGAAGCGGTCCCAGCCACGTGGGGG 
lyProArgLeuGlyValArgAlaThrArgLysThrSerLysArgSerGlnProArgGlyGl 

190 200 210 220 230 

GCGCCGGCCCATCCCTAAAGATCGGCGCTCCACTGGCAAGTCCTGGGGGAAACCAGGATAC 
yArgArgPro 1 1 eProLysAspArgArgSerThrGl yLysSerTrpGl yLysProGl yTyr 

250 260 270 280 290 

CCCTGGCCCCTATATGGGAATGAGGGACTCGGCTGGGCAGGGTGGCTTCTGTCCCCCCGAG 
ProTrpProLeuTyrGl yAsnGl uGl yLeuGl yTrpAl aGl yTrpLeuLeuSerPro ArgG 

310 320 330 340 350 360 

GTTCCCGTCCCTCTTGGGGCCCCACTGACCCCCGGCATAGGTCGCGCAATGTGGGTAAGGT 
lySerArgProSerTrpGlyProThrAspProArgHisArgSerArgAsnValGlyLysVa 



CATC 
llle 
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Fig . 15 a 



l io 20 30 40 50 60 

CGCGCAACTTGGGTAAGGTCATCGATACCCTCACATGCGGCTTCGCCGACCTCATGGGGT 
ArgAsnLeuGlyLysVal I leAspThrLeuThrCysGlyPheAl aAspLeuMetGlyT 

70 80 90 100 110 

ACATTCCGCTTGTCGGCGCCCCCCTAGGGGGTGCTGCCAGGGCCCTGGCACATGGTGTCCG 
yrlleProLeuValGlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValAr 

130 140 150 160 170 

GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCTATC 
gValLeuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerl le 

190 200 210 220 .230 

TTCCTCTTGGCTTTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAGGTGCGCAACG 
PheLeuLeuAlaLeuLeuSerCysLeuThrl leProAlaSerAlaTyrGluValArgAsnV 

250 260 270 280 290 

TATCCGGGATATACCATGTCACGAACGACTGCTCCAACTCAAGTATTGTGTATGAGGCAGC 
alSerGlyl leTyrHisValThrAsnAspCysSerAsnSerSerl leValTyrGluAlaAl 

310 320 330 340 350 360 

GGACATGATCATGCATACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAACTCCTCCCGT 

aAspMet 1 1 eMe tHi sThrProGl yCysVal ProCysVal ArgGluAsnAsnSerSerArg 

370 380 390 400 410 420 

TGCTGGGCAGCGCTCACTCCCACGTTAGCGGCCAGGAACACCAGCGTCCCCACTACGACAA 

CysTrpAl aA 1 aLeuThrProThrLeuAl aAl aArgAsnThrSerValProThrThrThr I 

430 440 450 460 470 480 

TACGACGGCATGTCGATTTGCTCGTTGGGGCGGCTGCTTTCTGCTCCGCTATGTACGTGGG 

1 eArgArgHi sVal AspLeuLeuVal Gl yA 1 aAl aAl aPheCysSer Al aMe tTyrVal G 1 
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Fig. 15 b 



490 500 510 520 530 540 

GGATCTCTGTGGATCTGTCTTCCTCGTTTCCCAGCTGTTCACTTTCTCACCTCGTCGGCAT 

yAspLeuCysGlySerValPheLeuValSerGlnLeuPheThrPheSerProArgArgHis 

550 560 570 580 590 600 

GAGACAGTACAGGACTGCAACTGCTCAATCTATCCCGGCCACTTGACAGGTCATCGCATGG 
GluThrValGlnAspCysAsnCysSerl leTyrProGlyHisLeuThrGlyHisArgMetA 

610 620 630 640 650 660 

CTTGGGATATGATGATGAACTGGTCACCTACAACAGCCCTAGTGGTGTCGCATCTACTCCG 
laTrpAspMetMetMetAsnTrpSerProThrThrAlaLeuValValSerHisLeuLeuAr 

680 690 700 710 720 

GATCCCACAAGCTGTCATGGACATGGTGGCGGGGGCTCACTGGGGAGTCCTAGCGGGCCTC 
gl leProGlnAlaValMetAspMetValAlaGlyAlaHisTrpGlyValLeuAlaGlyLeu 

740 750 760 770 780 

GCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGATTGTGATGCTACTCTTCGCCG 

AlaTyrTyrSerMetValGlyAsnTrpAlaLysValLeuI 1 eValMe tLeuLeuPheAl aG 

800 810 820 830 840 

GCGTTGACGGGACCACCTATGTGACAGGGGGGACGACAGGCCGCACCACCAGCTCGTTCGC 
lyValAspGlyThrThrTyrValThrGlyGlyThrThrGlyArgThrThrSerSerPheAl 

860 870 880 890 900 

ATCCCTCTTTACACTTGGGTCGCATCAGAAGGTCCAGCTTATAAATACCAATGGCAGCTGG 

aSerLeuPheThrLeuGlySerHisGlnLysValGlnLeuIleAsnThrAsnGlySerTrp 

920 930 
CACATCAACAGGACCGCC 
Hi si leAsnArgThcAla 
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Fig. 16 



, in 20 30 40 50 60 

CGCCGGTATGAGACGGCGCAAGACTGCAATTGCTCACTCTATCCCGGTCACGTATCTGGT 

ArgArgTyrGl uThrAl aGl nAspCysAsnCysSerLeuTyrProGlyHi sVal SerGl y 

70 80 90 100 110 

CACCGCATGGCTTGGGATATGATGATGAACTGGTCACCTACAACGGCCCTAGTGGTATCGC 
HisArgMetAlaTrpAspMetMetMetAsnTrpSerProThrThrAlaLeuValValSerG 

130 140 150 160 170 

AGCTACTCCGGATCCCACAAGCCGTCGTGGACATGGTGGCGGGGGCCCACTGGGGAGTCCT 

InLeuLeuArglleProGlnAlaValValAspMetValAlaGlyAlaHisTrpGlyValLe 

190 200 210 220 230 

AGCGGGCCTTGCCTACTATTCCATGGTGGCGAACTGGGCTAAGGTCTTGGTTGTGATGCTA 

uAlaGlyLeuAlaTyrTyrSerMetValAlaAsnTrpAlaLysValLeuValValMetLeu 

250 260 270 280 290 

CTCTTTGCCGGCGTTGACGACGGGAAGACCACCGTGACGGGGGGGAGCGCAGCCTTCCAGT 

LeuPheAlaGlyValAspAspGlyLysThrThrValThrGlyGlySerAlaAlaPheGlnS 

q i o 320 330 340 350 360 

CCAGGAAGTTAGTGTCCTTCTTCTCACCAGGGCCGAAACAAAATATCCAGCTTGATAACAC 

370 380 390 400 410 420 



CAACGGCAGCTGGCACATCAACAGGACTGCCCTGAATTGCAATGACT 
rAsnGlySerTrpHisIleAsnArgThrAlaLeuAsnCysAsnAspSerLeuGlnThrGly 

TTCATCGCTGCGCTGTTCTACGCGCACAAGTTCAATTCGTCCGG^ 
PhelleAlaAlaLeuPheTyrAlaHisLysPheAsnSerSerGlyCysLeuGluArgMetA 

c nrt 510 520 530 540 

CCAGCTGCCGCCCCATTGACAAGTTCGCGCAGGGGTGGGGTCCC^ 
TaSerCysArgPror^AspLysPheAlaGlnGlyTrpGlyProIleThrHisAspThrPr 

550 
TAAGATCCCGG 
oLysI lePro 
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Fig. 17 



! in 20 30 40 50 60 

GACACCGTATGGCATGGGACATGATGATGAACTGGTCGCCCACGGCTACCATGATTCTGG 
HisArgMetAlaTrpAspMetMetMetAsnTrpSerProThrAlaThrMetlleLeuA 

70 80 90 100 110 

CGTATGTGATGCGCATCCCCGAGGTCGTCATGGACATCATTGGCGGGGCTCACTGGGGCGT 

laTyrValMetArglleProGluValValMetAspIlelleGlyGlyAlaHisTrpGlyVa 

130 140 150 160 170 

CATGTTCGGCTTGGGCTATTTTTCTATGCAGGGGGCTTGGGCAAAAGTCGTTGTCATCCTT 
lMetPheGlyLeuGlyTyrPheSerMetGlnGlyAlaTrpAlaLysValValVallleLeu 

190 200 210 220 230 

CTGCTGGCCGCTGGGGTGGATGCGACTACCCTCAGCGTTGGGGGCTCTGCCGCGCACACCA 

LeuLeuAlaAlaGlyValAspAlaThrThrLeuSerValGlyGlySerAlaAlaHisThrT 

250 260 270 

CCGGCGGCCTTGTCGGCTTGTTCAAGCCTGGCG 
hrGlyGlyLeuValGlyLeuPheLysProGly 
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Fig. 18 



, in 20 30 40 50 60 

CGCTTGTCGGCGCCCCCCTAGGGGGTGCTGCCAGGGCCCTGGCACATGGTGTCCGGGTTC 
LeuValGlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValL 

70 80 90 100 110 

TGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCTATCTTCCT 
euGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePheLe 

130 140 150 160 170 

CTTGGCTTTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAGGTGCGCAACGTATCC 

uLeuAlaLeuLeuSerCysLeuThrlleProAlaSerAlaTyrGluValArgAsnValSer 

ion 200 210 220 230 

GGGATATACCATGTCACGAACGACTGCTCCAACTCAAGTATTGTGTATGAGGCAGCGGACA 

Gl yl 1 eTyrHi sVal ThrAsnAspCysSerAsnSerSer 1 1 eValTyrGl uAl aAl aAspM 

,c fl 260 270 280 290 

TGATCATGCATACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAACTCCTCCCGTTGCTG 

etlleMetHisThrProGlyCysValProCysValArgGUAsnAsnSerSerArgCysTr 

Tin 320 330 340 350 360 

GGCAGCGCTCACTCCCACGTTAGCGGCCAGGAACACCAGCGTCCCCACTACGAC^ 
PA1 aAl aLeuThrProThrLeuAl aAl aArgAsnThrSerVal ProThrThrThr I l eArg 

-*<»n 400 410 420 

cggcStgtcgatttgctcgttggggcggctgctttctgctccgctatgtacgtgggggatc 
tctgtggatctgtcttcctcgtttcccagctgttcactttctcacct^ 

euCysGlySerValPheLeuValSerGlnLeuPheThrPheSerProArgArgHisGluTh 
AGTACAGGACTGCAACTGCTCAATCTATCCCGGCCACTTG^ 

rValGlnAspCysAsnCysSerlleTyrProGlyHisLeuThrGlyHisArgMetAiairp 

550 560 570 580 590 __600 

GATATC 
AspMel 

Crt cqa 700 710 720 

CTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGATTG 

740 
GACGGGACCAC 
AspGlyThr 



GATATGATGATGAACTGGTCACCTACAACAGCCCTAGTGGTGTCGCAT^ 
AspMetMetMetAsnTrpSerProThrThrAlaLeuValValSerHisLeuLeuArgn 
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Fig. 19 
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Fig. 21 
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Fig. 22 
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Fig. 23 
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@ This invention relates to a DNA fragment 
encoding a constitutive polypeptide of struc- 
tural protein in a non-A non-B hepatitis type 
virus, which is obtained by means of genetic 
engineering from non-A non-B type hepatitis 
vims RNA isolated directly from blood plasma 
of non-A non-B hepatitis patients, to an expres- 
sion vector containing the DNA fragment, to a 
transformant obtained by transforming a host 
with the expression vector and to an expressed 
polypeptide obtained by cutturing the transfor- 
mant and a process for producing the polypep- 
tide. 

Results ->-•■ 

The genetic product obtained by expressing 
the DNA in the transformant is useful as a non-A 
non-B hepatitis specific antigen for highly accu- 
rate diagnosis of non-A non-B hepatitis pa- 
tients. 
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